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OM protein - protein search, using sw model 



Run on: 



March 1, 2005, 08:40:57 ; Search time 23.793 Seconds 

(without alignments) 
1690.542 Million cell updates/sec 



Title: 



Sequence: 



US-10-624-932-2 COPY 495 598 



Perfect score: 559 



1 TSNMTYGTFNFLGGRLMIPN CGEPSPDSWSLRLKKQSCEG 104 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



2105692 seqs, 386760381 residues 



Total number of hits satisfying chosen parameters: 



2105692 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


559 


100.0 


437 


4 


AAB50690 


Aab50690 Human 


2 


559 


100.0 


544 


7 


ADG42581 


Adg42581 Human 


3 


559 


100.0 


817 


8 


ADH71624 


Adh71624 Human 


4 


559 


100.0 


833 


8 


ADH71622 


Adh71622 Human 


5 


559 


100.0 


842 


5 


AAU74818 


Aau74818 Human 


6 


559 


100.0 


842 


8 


ADL06574 


Adl06574 Human 


7 


559 


100.0 


898 


5 


AAU85403 


Aau85403 Human 


8 


559 


100.0 


898 


5 


AAU97899 


Aau97899 Human 


9 


559 


100.0 


898 


8 


ADH71618 


Adh71618 Human 



HS1 
tra 
pro 
pro 
REP 
turn 
pro 
net 
pro 



10 


559 


100. 


0 


898 


8 


ADH71626 


Adh71626 


Human pro 


11 


559 


100.0 


899 


5 


AAU79939 


Aau79939 


Human UNC 


12 


559 


100. 


0 


899 


7 


ADG42569 


Adg42569 


Novel hum 


13 


559 


100. 


0 


899 


8 


ADH71636 


Adh71636 


Human pro 


14 


559 


100. 


0 


899 


8 


ADH71642 


Adh71642 


Human pro 


15 


559 


100. 


0 


899 


8 


ADH71648 


Adh71648 


Human pro 


16 


559 


100. 


0 


899 


8 


ADH71610 


Adh71610 


Human pro 


17 


559 


100. 


0 


899 


8 


ADH71628 


Adh71628 


Human pro 


18 


559 


100. 


0 


899 


8 


ADH71640 


Adh7164 0 


Human pro 


19 


559 


100. 


0 


899 


8 


ADH71630 


Adh71630 


Human pro 


20 


559 


100. 


0 


899 


8 


ADH71650 


Adh71650 


Human pro 


21 


559 


100. 


0 


899 


8 


ADH71644 


Adh71644 


Human pro 


22 


559 


100. 


0 


899 


8 


ADH71646 


Adh7164 6 


Human pro 


23 


559 


100. 


0 


899 


8 


ADH71638 


Adh71638 


Human pro 


24 


555 


99. 


3 


436 


4 


AAM25589 


Aam25589 


Human pro 


25 


555 


99. 


3 


899 


8 


ADH71632 


Adh71632 


Human pro 


26 


554 


99. 


1 


899 


8 


ADH71634 


Adh71634 


Human pro 


27 


553 


98. 


9 


556 


2 


AAW78899 


Aaw78899 


Human UNC 


28 


553 


98. 


9 


898 


2 


AAW78898 


Aaw78898 


Rat UNC-5 


29 


553 


98. 


9 


898 


5 


AAU10543 


Aaul0543 


Rat netri 


30 


553 


98. 


9 


898 


5 


AAU97900 


Aau97900 


Rat netri 


31 


548 


98. 


0 


898 


7 


ADG42580 


Adg42580 


Rat trans 


32 


544 


97. 


3 


400 


7 


ADN95115 


Adn95115 


Human LEC 


33 


465.5 


83. 


3 


943 


4 


AAM79128 


Aam79128 


Human pro 


34 


347.5 


62. 


2 


266 


4 


AAB50650 


Aab50650 


Human UNC 


35 


345 


61. 


7 


931 


7 


ADG42582 


Adg42582 


Mouse tra 


36 


337 


60. 


3 


526 


4 


AAB50648 


Aab50648 


Human UNC 


37 


337 


60. 


3 


931 


4 


AAB50691 


Aab50691 


Human UNC 


38 


337 


60. 


3 


931 


7 


ADE63098 


Ade63098 


Human Pro 


39 


337 


60. 


3 


931 


7 


ABU64297 


Abu64297 


Human thr 


40 


337 


60. 


3 


931 


8 


ADR99258 


Adr99258 


Human unc 


41 


337 


60. 


3 


964 


8 


ADR99250 


Adr99250 


Human 1RO 


42 


337 


60. 


3 


982 


4 


ABG11551 


Abgll551 


Novel hum 


43 


334 


59. 


7 


931 


7 


ADG42584 


Adg42584 


Human tra 


44 


322.5 


57. 


7 


929 


7 


ADG42583 


Adg42583 


Human tra 


45 


318 


56.9 


238 


4 


AAB50646 


Aab50646 


Human UNC 



ALIGNMENTS 



RESULT 1 
AAB50690 

ID AAB50690 standard; protein; 437 AA. 
XX 

AC AAB50690; 
XX 

DT 19-MAR-2001 (first entry) 
XX 

DE Human HS1 protein SEQ ID NO: 89. 
XX 

KW Human; Caenorhabditis elegans; UNC-5; splice variant; nematode worm; 

KW protein-protein interaction; identification. 

XX 

OS Homo sapiens . 
XX 

PN WO200073328-A2. 



XX 

PD 07-DEC-2000. 
XX 

PF 02-JUN-2000; 2000WO-EP005108 . 
XX 

PR 01-JUN-1999; 99GB-00012755 . 
XX 

PA (DEVG-) DEVGEN NV. 
XX 

PI Van Criekinge W, Roelens I, Bogaert T, Verwaerde P; 
XX 

DR WPI; 2001-016508/02. 
XX 

PT Three variants of human unc-5C cDNAs (unc-5Cb, unc-5Cc and unc-5C8) and a 

PT human unc-5HSl cDNA, useful in yeast two hybrid experiments for 

PT identifying unknown human cDNAs which encode proteins that interact with 

PT the human unc~5C protein. 

XX 

PS Disclosure; Page 223-224; 246pp; English. 
XX 

CC The present invention describes 3 variants of human unc-5C cDNAs (unc- 

CC 5Cb, unc-5Cc and unc-5C8) which correspond to alternatively spliced unc- 

CC 5C transcripts , and a human unc-5HSl cDNA which shares homology with the 

CC Rattus norvegicus unc-5HSl cDNA. Also described are assays based on 

CC protein-protein-interactions between the unc-5 protein and a variety of 

CC different interacting proteins. The unc-5C variant cDNAs and unc-5HSl 

CC cDNA are useful in methods for identifying compounds which reduce or 

CC inhibit the lethal phenotype associated with the expression of the unc-5 

CC death domain in yeast. They are also useful in yeast two hybrid 

CC experiments for identifying unknown human cDNAs which encode proteins 

CC that interact with the human unc-5C protein. AAC90914 to AAC90971 and 

CC AAB50646 to AAB50693 represent sequences used in the exemplification of 

CC the present invention 

XX 

SQ Sequence 437 AA; 

Query Match 100.0%; Score 559; DB 4; Length 437; 
Best Local Similarity ' 100.0%; Pred. No. 1.2e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 93 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 94 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 137 



RESULT 2 
ADG42581 

ID ADG42581 standard; protein; 544 AA. 
XX 

AC ADG42581; 
XX 

DT 26-FEB-2004 (first entry) 
XX 



DE Human transmembrane receptor UncSHl homologue. 
XX 

KW cytostatic; gene therapy; NOVX-agonist ; NOVX-antagonist ; pharmaceutical; 

KW NOVX-associated disorder; cancer; human; transmembrane receptor; 

KW UncSHl homologue. 
XX 

OS Homo sapiens. 
XX 

PN US2003204052-A1. 
XX 

PD 30-OCT-2003. 
XX 

PF 04-OCT-2001; 2001US-00970944 . 
XX 

PR 04-OCT-2000; 2000US-0237862P . 
XX 

PA (HERR/) HERRMANN J L. 

PA (RAST/) RASTELLI L. 

PA (SHIM/) SHIMKETS R A. 
XX 

PI Herrmann JL, Rastelli L, Shimkets RA; 
XX 

DR WPI; 2003-900673/82. 
XX 

PT New NOVX gene or NOVX-specif ic antibody, useful for preparing a 

PT composition for treating or preventing a NOVX-associated disorder, e.g., 

PT cancer. 

XX 

PS Disclosure; SEQ ID NO 14; 118pp; English. 
XX 

CC The invention describes a new isolated polypeptide comprising: a 

CC polypeptide or its mature form comprising a sequence not given in the 

CC specification; or a variant of (A) , where one or more amino acid residues 

CC in the variant differs in no more than 15% from the amino acid sequence 

CC of the mature form. The pharmaceutical composition may be administered 

CC via oral, transdermal, rectal or parenteral route. The polypeptide, 

CC nucleic acid or antibody is useful for preparing a composition for 

CC treating or preventing a NOVX-associated disorder, e.g., cancer. This is 

CC the amino acid sequence of a transmembrane receptor homologue used in a 

CC comparison with the novel human proteins of the invention. 

XX 

SQ Sequence 544 AA; 

Query Match 100.0%; Score 559; DB 7; Length 544; 
Best Local Similarity 100.0%; Pred. No. 1.6e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 141 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 200 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I III II I I I I I III I I I I I I I I I I I I I I I I I II I II I I I II I II 

Db 201 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 244 



RESULT 3 



ADH71624 




ID 


ADH71624 standard; protein; 817 AA. 


XX 






AC 


ADH71624; 




XX 






DT 


25-MAR-2004 


(first entry) 


XX 






DE 


Human protein of the invention NOV21h SEQ ID NO: 520. 


XX 






KW 


human; cytostatic; immunomodulator ; neuroprotective; nootropic; 


KW 


anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 


KW 


vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease 


KW 


obesity; diabetes; infectious disease; metabolic syndrome X; 


KW 


dyslipidaemia . 


XX 






OS 


Homo sapiens . 


XX 






PN 


WO2003102155-A2. 


XX 






PD 


ll-DEC-2003. 




XX 






PF 


03-JUN-2003; 2003WO-US017430 . 


XX 






PR 




200211^-0^8^1 20P 


PR 


04- JUN-2002 . 


2002US-03857 84P 


PR 


05- JUN-2002 . 


2002US-0386041P 


PR 


05- JUN-2002 f 


2002US-0386047P. 


PR 


06- JUN-2002, 


• 2002US-0386376P . 


PR 


06- JUN-2002 , 


' 2002US-0386453P. 


PR 


06- JUN-2002 , 


• 2002US-0386864P. 


PR 


06-JUN-2002, 


• 2002US-0387016P. 


PR 


07-JUN-2002, 


• 2002US-0386796P. 


PR 


07-JUN-2002, 


• 2002US-0386816P. 


PR 


07-JUN-2002, 


• 2002US-0386931P. 


PR 


07-JUN-2002, 


• 2002US-0386942P. 


PR 


07- JUN-2002, 


r 2002US-0386971P. 


PR 


07-JUN-2002, 


; 2002US-0387262P. 


PR 


08-JUN-2002, 


; 2002US-0296960P. 


PR 


10-JUN-2002, 


;' 2002US-0387400P. 


PR 


10-JUN-2002, 


I 2002US-0387535P. 


PR 


ll-JUN-2002, 


? 2002US-0387610P. 


PR 


ll-JUN-2002, 


; 2002US-0387625P. 


PR 


ll-JUN-2002, 


; 2002US-0387634P. 


PR 


ll-JUN-2002, 


j 2002US-0387668P. 


PR 


11- JUN-2002, 


? 2002US-0387696P. 


PR 


ll-JUN-2002, 


? 2002US-0387702P. 


PR 


ll-JUN-2002, 


; 2002US-0387836P. 


PR 


ll-JUN-2002, 


j 2002US-0387859P. 


PR 


12-JUN-2002, 


; 2002US-0387933P. 


PR 


12-JUN-2002, 


; 2002US-0387934P. 


PR 


12-JUN-2002 


? 2002US-0387960P. 


PR 


12-JUN-2002 


? 2002US-0388022P. 


PR 


12-JUN-2002 


; 2002US-0388096P. 


PR 


13-JUN-2002 


; 2002US-0389123P. 


PR 


14-JUN-2002 


; 2002US-0389118P. 


PR 


14-JUN-2002 


; 2002US-0389120P. 


PR 


14-JUN-2002 


? 2002US-0389144P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 



14 


-JUN- 


•2002, 


; 2002US- 


0389146P. 


17 


-JUN- 


•2002 


- 2002US- 


0389729P. 


17 


-JUN- 


-2002, 


? 2002US- 


0389742P. 


18 


-JUN- 


•2002, 


; 2002US- 


0389884P. 


19 


-JUN- 


•2002, 


; 2002US- 


0390006P. 


19 


-JUN- 


•2002, 


; 2002US- 


0390209P. 


21 


-JUN- 


•2002, 


? 2002US- 


0390763P. 


17 


-JUL- 


•2002, 


f 2002US- 


0396706P. 


06 


-AUG- 


2002, 


; 2002US- 


0401628P. 


09 


-AUG- 


•2002, 


; 2002US- 


0402156P. 


09 


-AUG- 


•2002, 


P 2002US- 


0402256P. 


09 


-AUG- 


•2002, 


? 2002US- 


0402389P. 


12 


-AUG- 


2002, 


; 2002US- 


0402786P. 


12 


-AUG- 


2002, 


? 2002US- 


0402816P. 


12 


-AUG- 


•2002, 


; 2002US- 


0402821P. 


12 


-AUG- 


2002, 


; 2002US- 


0402832P. 


13 


-AUG- 


2002, 


? 2002US- 


0403448P. 


13 


-AUG- 


2002, 


; 2002US- 


0403459P. 


13 


-AUG- 


2002, 


; 2002US- 


0403531P. 


13 


-AUG- 


•2002, 


; 2002US- 


0403532P. 


13 


-AUG- 


2002, 


; 2002US- 


0403563P. 


13 


-AUG- 


2002, 


; 2002US- 


0406317P. 


15 


-AUG- 


2002, 


; 2002US- 


0403617P. 


26 


-AUG- 


2002, 


; 2002US- 


0406182P. 


26 


-AUG- 


2002, 


; 2002US- 


0406355P. 


27 


-AUG- 


•2002, 


; 2002US- 


0406240P. 


12 


-SEP- 


2002, 


; 2002US- 


0410084P. 


20 


-SEP- 


•2002, 


? 2002US- 


0412528P. 


23 


-SEP- 


•2002, 


; 2002US- 


0412731P. 


30 


-SEP- 


2002, 


? 2002US- 


0414801P. 


30 


-SEP- 


2002 


; 2002US- 


0414839P. 


30 


-SEP- 


2002 


; 20Q2US- 


0414840P. 


30 


-SEP- 


2002, 


; 2002US- 


0414954P. 


09 


-OCT- 


•2002, 


; 2002US- 


0417186P. 


09 


-OCT- 


2002, 


; 2002US- 


0417406P. 


23 


-OCT- 


•2002, 


; 2002US- 


0420639P. 


28 


-OCT- 


2002, 


; 2002US- 


0421156P. 


31 


-OCT- 


•2002, 


; 2002US- 


0422690P. 


01 


-NOV- 


2002, 


; 2002US- 


0423130P. 


05 


-NOV- 


2002, 


? 2002US- 


00423798. 


05 


-NOV- 


•2002, 


\ 2002US- 


0423798P. 


12 


-NOV- 


•2002, 


; 2002US- 


0425453P. 



(CURA-) CURAGEN CORP. 

Alsobrook JP, TVlvarez E, Anderson DW / Boldog FL, Casman SJ; 
Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR f Ellerman K; 
Ettenberg S, Gangolli EA, Gerlach VL, Gorman I>, Gunther E, Guo X; 
Gusev VY, Herrmann JL, Ji W, Kekuda R f Li L, Liu X, Macdougall JR; 
Maclachlan T f Malyankar UM, Mezick AJ f Millet I, Mishra VS; 
Padigaru M f Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 
Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 
Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 
Zhong H; 

WPI; 2004-081935/08. 
N-PSDB; ADH71623. 



XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 520; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 817 AA; 

Query Match 100.0%; Score 559; DB 8; Length 817; 

Best Local Similarity 100.0%; Pred. No. 2.7e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 414 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 473 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 474 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 517 



RESULT 4 
ADH71622 

ID ADH71622 standard; protein; 833 AA. 
XX 

AC ADH71622; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21g SEQ ID NO: 518. 
XX 

KW human; cytostatic; immunomodulator; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2. 



XX 








PD 


11 


-DEC- 


-2003 


XX 








PF 


03 


-JUN- 


•2003 


XX 








PR 


03 


-JUN- 


2002 


PR 


04 


-JUN- 


2002 


PR 


05 


-JUN- 


2002 


PR 


05 


-JUN- 


2002 


PR 


06 


-JUN- 


2002 


PR 


06 


-JUN- 


2002 


PR 


06 


-JUN- 


2002 


PR 


06 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


07 


-JUN- 


2002 


PR 


08 


-JUN- 


2002 


PR 


10 


-JUN- 


-2002 


PR 


10 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


2002 


PR 


11 


-JUN- 


•2002 


PR 


11 


-JUN- 


•2002 


PR 


12 


-JUN- 


2002 


PR 


12 


-JUN- 


2002 


PR 


12 


-JUN- 


2002 


PR 


12 


-JUN- 


2002 


PR 


12 


-JUN- 


2002 


PR 


13 


-JUN- 


2002 


PR 


14 


-JUN- 


-2002 


PR 


14 


-JUN- 


-2002 


PR 


14 


-JUN- 


•2002 


PR 


14 


-JUN- 


-2002 


PR 


17 


-JUN- 


2002 


PR 


17 


-JUN- 


•2002 


PR 


18 


-JUN- 


■2002 


PR 


19 


-JUN- 


■2002 


PR 


19 


-JUN- 


■2002 


PR 


21 


-JUN- 


-2002 


PR 


17 


-JUL- 


•2002 


PR 


06 


-AUG- 


•2002 


PR 


09 


-AUG- 


•2002 


PR 


09 


-AUG- 


•2002 


PR 


09 


-AUG- 


•2002 


PR 


12 


-AUG- 


•2002 


PR 


12 


-AUG-2.002. 


PR 


12 


-AUG- 


-2002 


PR 


12 


-AUG- 


-2002 


PR 


13 


-AUG- 


-2002 


PR 


13 


-AUG- 


-2002 



2003WO-US017430. 

2002US-0385120P. 
2002US-0385784P. 
2002US-0386041P. 
2002US-0386047P. 
2002US-0386376P. 
2002US-0386453P. 
2002US-0386864P. 
2002US-0387016P. 
2002US-0386796P. 
2002US-0386816P. 
2002US-0386931P. 
2002US-0386942P. 
2002US-0386971P. 
2002US-0387262P. 
2002US-0296960P. 
2002US-0387400P. 
2002US-0387535P. 
2002US-0387610P. 
2002US-0387625P. 
2002US-0387634P. 
2002US-0387668P. 
2002US-0387696P. 
2002US-0387702P. 
2002US-0387836P. 
2002US-0387859P. 
2002US-0387933P. 
2002US-0387934P. 
2002US-0387960P. 
2002US-0388022P. 
2002US-0388096P. 
2002US-0389123P. 
2002US-0389118P. 
2002US-0389120P. 
2002US-0389144P. 
2002US-0389146P. 
2002US-0389729P. 
2002US-0389742P. 
2002US-0389884P. 
2002US-0390006P. 
2002US-0390209P. 
2002US-0390763P. 
2002US-0396706P. 
2002US-0401628P. 
2002US-0402156P. 
2002US-0402256P. 
2002US-0402389P. 
2002US-0402786P. 
20O2US-04.02816P. 
2002US-0402821P. 
2002US-0402832P. 
2002US-0403448P. 
2002US-0403459P. 



PR 13-AUG-2002; 2002US-0403531P . 

PR 13-AUG-2002; 2002US-04 03532P . 

PR 13-AUG-2002; 2002US-0403563P . 

PR 13-AUG-2002; 2002US-0406317P . 

PR 15-AUG-2002; 2002US-0403617P . 

PR 26-AUG-2002; 2002US-0406182P . 

PR 26-AUG-2002; 2002US-0406355P . 

PR 27-AUG-2002; 2002US-0406240P . 

PR 12-SEP-2002; 2002US-0410084P . 

PR 20-SEP-2002; 2002US-0412528P . 

PR 23-SEP-2002; 2002US-0412731P . 

PR 30-SEP-2002; 2002US-04 14801P . 

PR 30-SEP-2002; 2002US-0414839P . 

PR 30-SEP-2002; 2002US-0414840P . 

PR 30-SEP-2002; 2002US-0414954P . 

PR 09-OCT-2002; 2002US-0417186P . 

PR 09-OCT-2002; 2002US-0417406P . 

PR 23-OCT-2002; 2002US-0420639P . 

PR 28-OCT-2002; 2002US-0421156P . 

PR 31-OCT-2002; 2002US-0422690P . 

PR 01-NOV-2002; 2002US-0423130P . 

PR 05-NOV-2002; 2002US-00423798 . 

PR 05-NOV-2002; 2002US-0423798P . 

PR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T f Malyankar UM f Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71621. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 518; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator , 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson^ disease, obesity, diabetes, infectious 



CC diseases , metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 833 AA; 

Query Match 100.0%; Score 559; DB 8; Length 833; 

Best Local Similarity 100.0%; Pred. No. 2.7e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTET^FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVTILPLAGCQTL 60 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 430 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 489 

Qy 61 LSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 490 LSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEG 533 



RESULT 5 
AAU74818 

ID AAU74818 standard; protein; 842 AA. 
XX 

AC AAU74818; 
XX 

DT 23-APR-2002 (first entry) 
XX 

DE Human REPTR 1 protein. 
XX 

KW REPTR; human; antiinflammatory; cytostatic; immunosuppressive; antiviral; 

KW anti-HIV; antiarthritic; anticonvulsant; nootropic; neuroprotective; 

KW antiallergic; antibody; immunogen; endometriosis; 

KW gastrointestinal disorder; gastritis; oesophageal carcinoma; 

KW Crohn's disease; irritable bowel syndrome; ulcerative colitis; 

KW endocrine disorder; hypothalamus disorder; Kallman's disease; 

KW autoimmune disease; inflammatory disease; infertility; receptor; 

KW acquired immune deficiency syndrome; AIDS; rheumatoid arthritis; allergy; 

KW osteoarthritis; diabetes mellitus; multiple sclerosis; 

KW systemic lupus erythematosus; cell proliferative disorder; cancer; 

KW developmental disorder; Duchenne muscular dystrophy; 

KW Becker muscular dystrophy; neurological disorder; epilepsy; 

KW Alzheimer's disease; Huntington's disease; reproductive disorder. 

XX 

OS Homo sapiens . 
XX 

PN WO200198354-A2. 
XX 

PD 27-DEC-2001. 
XX 

PF 21-JUN-2001; 2001WO-US019942 . 
XX 

PR 21-JUN-2000; 2000US-0214027-P. 

PR 25-AUG-2000; 2000US-0228045P . 

PR 12-DEC-2000; 2000US-0255104P. 
XX 

PA (INCY-) INCYTE GENOMICS INC. 



XX 

PI Griffin JA, Kallick DA, Tribouley CM, Yue H, Nguyen DB, Tang YT; 

PI Lai P, Policky JL, Azimzai Y, Lu DAM, Graul R, Yao MG, Bur ford N; 

PI Hafalia AJA, Baughn MR, Bandman 0, Patterson C, Yang J, Xu Y; 

PI Gandhi AR, Warren BA, Ding L, Sanjanwala MS, Duggan BM, Lu Y; 
XX 

DR WPI; 2002-090432/12. 

DR N-PSDB; ABK15169. 
XX 

PT Twelve human receptors (referred to as REPTR-1 to REPTR-12) , useful in 

PT the diagnosis, treatment and prevention of gastrointestinal (e.g. 

PT gastritis), autoimmune/inflammatory (e.g. osteoarthritis) and cell 

PT proliferative (e.g. cancer ) disorders . 
XX 

PS Claim 45; Page 111-113; 157pp; English. 
XX 

CC This invention relates to twelve human receptors cDNA sequences referred 

CC to as REPTR-1 to REPTR-12), and the proteins encoded thereby. The 

CC proteins of the invention may have antiinflammatory, cytostatic, 

CC immunosuppressive, antiviral, anti-HIV, antiarthritic, muscular active 

CC general, anticonvulsant, nootropic, neuroprotective, antiallergic 

CC activities. The sequences of the invention may be used to produce REPTR 

CC agonists or antagonists, and the protein sequences may be used to raise 

CC anti-REPTR antibodies . These molecules and the REPTR polynucleotides and 

CC polypeptides of the invention are useful in the diagnosis , treatment and 

CC prevention of gastrointestinal (e.g. gastritis, oesophageal carcinoma, 

CC Crohn's disease, irritable bowel syndrome, ulcerative colitis), endocrine 

CC (e.g. hypothalamus disorder, Kallman's disease), autoimmune/ inflammatory 

CC (e.g. acquired immune deficiency syndrome (AIDS) , rheumatoid arthritis, 

CC allergies, osteoarthritis, diabetes mellitus, multiple sclerosis, 

CC systemic lupus erythematosus), cell proliferative (e.g. cancer), 

CC developmental (e.g. Duchenne and Becker muscular dystrophy), neurological 

CC (e.g. epilepsy, Alzheimer's disease, Huntington's disease) and 

CC reproductive (e.g. infertility, endometriosis) disorders. Numerous other 

CC examples of each disorder are given in the specification. The present 

CC sequence represents the human REPTR1 protein sequence of the invention 

XX 

SQ Sequence 842 AA; 

Query Match 100.0%; Score 559; DB 5; Length 842; 

Best Local Similarity 100.0%; Pred. No. 2.8e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 439 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 4 98 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I 
Db 499 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 542 

RESULT 6 
ADL06574 

ID ADL06574 standard; protein; 842 AA. 
XX 

AC ADL06574; 



XX 

DT 20-MAY-2004 (first entry) 
XX 

DE Human tumour-associated antigenic target (TAT) polypeptide #73. 
XX 

KW Human; tumour-associated antigenic target; TAT; cell death; tumour; 

KW cancer; cytostatic. 
XX 

OS Homo sapiens. 
XX 

PN WO2004016225-A2. 
XX 

PD 26-FEB-2004. 
XX 

PF 19-AUG-2003; 2003WO-US025892 . 
XX 

PR 19-AUG-2002; 

PR 21-AUG-2002; 

PR 23-SEP-2002; 

PR 15-OCT-2002; 

PR 15-NOV-2002; 

PR 02-JUL-2003; 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Desauvage FJ, Frantz G, Hillan KJ, Polakis P, Poison A, Smith V; 

PI Spencer SD, Wu TD, Zhang Z; 

XX 

DR WPI; 2004-257144/24. 

DR N-PSDB; ADL06497. 
XX 

PT New antibody that binds to a tumor-associated antigenic target (TAT) 

PT polypeptide, useful for preparing a composition for diagnosing or 

PT treating cancer. 
XX 

PS Claim 2; SEQ ID NO 154; 319pp; English. 
XX 

CC The present invention relates to the isolation of human tumour-associated 

CC antigenic target (TAT) polynucleotide and polypeptide sequences. Also 

CC disclosed is an antibody that binds to a TAT polypeptide. The antibody is 

CC a monoclonal antibody, an antibody fragment, a chimeric antibody or a 

CC humanised antibody. It is conjugated to a growth inhibitory agent. It is 

CC produced in bacteria or in CHO cells and induces death of a cell to which 

CC it binds . The antibody is useful for preparing a composition for 

CC diagnosing or treating tumours and cancer. The present sequence 

CC represents a human TAT polypeptide of the invention. 

XX 

SQ Sequence 842 AA; 



2002US-0404809P. 
2002US-0405645P. 
2002US-0413192P. 
2002US-0419008P. 
2002US-0426847P. 
2003US-0484959P. 



Query Match 100.0%; Score 559; DB 8; Length 842; 

Best Local Similarity 100.0%; Pred. No. 2.8e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
439 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 498 



Qy 
Db 



61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 
I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
499 LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 542 



RESULT 7 
AAU85403 

ID AAU85403 standard; protein; 898 AA. 
XX 

AC AAU85403; 
XX 

DT 21-MAY-2002 (first entry) 
XX 

DE Human protein NOV1. 
XX 

KW Human; NOVX; cardiomyopathy; atherosclerosis; diabetes; 

KW cell signal processing disorder; metabolic disorder; obesity; infection; 

KW anorexia; cancer-associated cachexia; cancer; neurodegenerative disorder; 

KW Alzheimer's disease; Parkinson's disease; immune disorder; 

KW haematopoietic disorders; dyslipidaemia; pain; asthma; hypertension; 

KW osteoporosis; Crohn's disease; multiple sclerosis; angina pectoris; 

KW myocardial infarction; ulcer; allergy; benign prostatic hypertrophy; 

KW psychosis; neurological disorder; anxiety; schizophrenia; 

KW manic depression; dementia; dyskinesia; Huntington's disease; 

KW Gilles de la Tourette ' s syndrome; gene therapy. 

XX 

OS Homo sapiens . 
XX 

PN WO200210216-A2. 
XX 

PD 07-FEB-2002. 
XX 

PF 30-JUL-2001; 2001WO-US024225 . 
XX 

PR 28-JUL-2000; 2000US-0221409P . 

PR 04-AUG-2000; 2000US-0222840P . 

PR 04-AUG-2000; 2000US-0223752P . 

PR 04-AUG-2000; 2000US-0223762P . 

PR 04-AUG-2000; 2000US-0223769P . 

PR 04-AUG-2000; 2000US-0223770P . 

PR 14-AUG-2000; 2000US-022514 6P . 

PR 15-AUG-2000; 2000US-0225392P . 

PR 15-AUG-2000; 2000US-0225470P . 

PR 16-AUG-2000; 2000US-0225697P. 

PR 01-FEB-2001; 2001US-0263662P . 

PR 05-APR-2001; 2001US-0281645P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Padigaru M, Mezes P, Mishra V, Burgess C, Casman S, Grosse WM; 

PI Alsobrook JP, Lepley DM, Gerlach VX, Macdougall JR, Smithson G; 
XX 

DR WPI; 2002-180074/23. 

DR N-PSDB; ABK37922. 
XX 

PT New isolated cytoplasmic, nuclear, membrane bound, or secreted 

PT polypeptide, useful for treating cardiomyopathy, atherosclerosis, 



PT infections, cancer, neurodegenerative, metabolic, hematopoietic and 

PT immune disorders . 

XX 

PS Claim 1; Page 11; 213pp; English. 
XX 

CC The invention relates to an isolated cytoplasmic, nuclear, membrane 

CC bound, or secreted polypeptide (NOVX, x= 1-14) their variants or mature 

CC form. Also included are the nucleic acids encoding the NOVX proteins , a 

CC vector comprising the nucleic acid, a cell comprising the vector, an anti 

CC -NOVX antibody and modulators of NOVX. NOVX, the nucleic acid and the 

CC antibody are useful for treating or preventing a NOVX-associated 

CC disorder, where the disorder is selected from cardiomyopathy, 

CC atherosclerosis, diabetes, a disorder related to cell signal processing 

CC and metabolic pathway modulation, metabolic disorders, obesity, 

CC infectious disease, anorexia, cancer-associated cachexia, cancer, 

CC neurodegenerative disorders, Alzheimer's disease, Parkinson's disease, 

CC immune disorders, haematopoietic disorders, and the various 

CC dyslipidaemias, metabolic disturbances associated with obesity, the 

CC metabolic syndrome X and wasting disorders associated with chronic 

CC diseases, bacterial, fungal, protozoal and viral infections, pain, 

CC bulimia, asthma, hypertension, urinary retention, osteoporosis, Crohn's 

CC disease, multiple sclerosis, Albright Hereditary Osteodystrophy, angina 

CC pectoris, myocardial infarction, ulcer, allergy, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders, including anxiety, 

CC schizophrenia, manic depression, delirium, dementia, and dyskinesias, 

CC such as Huntington's disease and Gilles de la Tourette ' s syndrome. The 

CC nucleic acid is useful in gene therapy. The present sequence represents a 

CC NOVX protein 

XX 

SQ Sequence 898 AA; 

Query Match 100.0%; Score 559; DB 5; Length 898; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL. 60 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 495 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 8 
AAU97899 

ID AAU97899 standard; protein; 898 AA. 
XX 

AC AAU97899; 
XX 

DT 27-AUG-2002 (first entry) 
XX 

DE Human netrin binding membrane receptor UNC5H-1 protein. 
XX 

KW Netrin binding membrane receptor; receptor; UNC5H-1; human; nootropic; 

KW neuroprotective; cytostatic; antiparkinsonian; cerebroprotective; cancer; 

KW central nervous system; CNS; stroke; Parkinson's disease; 



KW 


multiple sclerosis; Alzheimer's disease. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Domain 


152. .223 


FT 




/note= "Immunoglobulin domain " 


FT 


Domain 


247. .294 


FT 




/note= "Thrombospondine type 1 domain 


FT 


Domain 


302. .348 


FT 




/note= "Thrombospondine type 1 domain 


FT 


Region 


361. .382 


FT 




/note= "Transmembrane region" 


FT 


Domain 


495. .598 


FT 




/note= "ZU5 domain" 


FT 


Domain 


817. .897 


FT 




/note= "Death domain" 



XX 

PN WO200233080-A2. 
XX 

PD 25-APR-2002. 
XX 

PF 15-OCT-2001; 2001WO-EP011891 . 
XX 

PR 16-OCT-2000; 2000US-0240061P . 
XX 

PA (FARB ) BAYER AG. 
XX 

PI Koehler RH; 
XX 

DR WPI; 2002-463314/49. 

DR N-PSDB; ABK52891. 
XX 

PT Novel human netrin binding membrane receptor polypeptide and 

PT polynucleotides for identifying modulating agents useful in treating 

PT diseases e.g. Parkinson's disease, multiple sclerosis, stroke, 

PT Alzheimer's disease. 

XX 

PS Claim 1; Fig 2; 94pp; English. 
XX 

CC This invention relates to the DNA and protein sequences of a novel 

CC purified human netrin binding membrane receptor, UNC5H-1 . The DNA 

CC sequence of the invention is useful as a probe for detecting a nucleic 

CC acid encoding the UNC5H-1 protein in a biological sample. The sequences 

CC of the invention are useful to screen for agents which decrease the 

CC activity of the UNC5H-1 protein. The sequences are also useful for 

CC screening agents which regulate (modulate) the activity of the protein of 

CC the invention. A pharmaceutical composition containing the protein of the 

CC invention or a reagent that modulates the activity of the UNC5H-1 protein 

CC may be useful for treating a UNC5H-1 dysfunction related disease such as 

CC cancer or a central nervous system (CNS) disorders (e.g, Parkinson's 

CC disease, multiple sclerosis, stroke and Alzheimer's disease). Fusion 

CC proteins comprising the UNC5H-1 protein are useful for generating 

CC antibodies and for in various assay systems, and the protein can be used 

CC as a bait protein in a two-hybrid assay or three-hybrid assay. The method 

CC of the invention is useful for detecting a coding sequence for the UNC5H- 

CC 1 protein. The present sequence represents the human netrin binding 



CC membrane receptor UNC5H-1 protein of the invention 
XX 

SQ Sequence 898 AA; 



Query Match 100.0%; Score 559; DB 5; Length 898; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 9 
ADH71618 

ID ADH71618 standard; protein; 898 AA. 
XX 

AC ADH71618; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21e SEQ ID NO: 514. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2 . 
XX 



PD 


11- 


DEC- 


2003. 






XX 












PF 


03- 


JUN- 


2003; 


2003WO- 


US017430. 


XX 












PR 


03- 


JUN- 


2002; 


2002US- 


0385120P. 


PR 


04- 


JUN- 


2002; 


2002US- 


0385784P. 


PR 


05- 


JUN- 


2002; 


2002US- 


0386041P. 


PR 


05- 


JUN- 


2002; 


2002US- 


0386047P. 


PR 


06- 


JUN- 


2002; 


2002US- 


0386376P. 


PR 


06- 


JUN- 


2002; 


2002US- 


0386453P. 


PR 


06- 


JUN- 


2002; 


2002US- 


0386864P. 


PR 


06- 


JUN- 


2002; 


2002US- 


0387016P. 


PR 


07- 


JUN- 


2002; 


2002US- 


0386796P. 


PR 


07- 


JUN- 


•2002; 


2002US- 


0386816P. 


PR 


07- 


JUN- 


•2002; 


2002US- 


0386931P. 


PR 


07- 


JUN- 


-2002; 


2002US- 


0386942P. 


PR 


07- 


JUN- 


•2002; 


2002US- 


0386971P. 


PR 


07- 


JUN- 


•2002; 


2002US- 


0387262P. 


PR 


08- 


JUN- 


■2002; 


2002US- 


0296960P. 


PR 


10- 


JUN- 


•2002; 


2002US- 


0387400P. 



PR 


10- 


-JUN- 


2002, 


2002US- 


0387535P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387610P. 


PR 


11- 


-JUN- 


2002, 


• 2002US- 


0387625P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387634P. 


PR 


11- 


-JUN- 


2002, 


■ 2002US- 


0387668P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387696P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387702P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387836P. 


PR 


11- 


-JUN- 


2002, 


2002US- 


0387859P. 


PR 


12- 


-JUN- 


2002, 


2002US- 


0387933P. 


PR 


12- 


-JUN- 


2002, 


• 2002US- 


0387934P. 


PR 


12- 


-JUN- 


2002, 


2002US- 


0387960P. 


PR 


12- 


-JUN- 


2002, 


2002US- 


0388022P. 


PR 


12- 


-JUN- 


2002, 


2002US- 


0388096P. 


PR 


13- 


-JUN- 


2002, 


2002US- 


0389123P. 


PR 


14- 


-JUN- 


2002, 


2002US- 


0389118P. 


PR 


14- 


-JUN- 
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PR 05-NOV-2002; 2002US-00423798 . 

PR 05-NOV-2002; 2002US-0423798P . 

PR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71617. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 514; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 898 AA; 

Query Match 100.0%; Score 559; DB 8; Length 898; 
Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I II I I I I I I I I I I I I I II I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAIPRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I II I III I III III I I I I I I II I I I I I II III II III I I I I I I 

Db 555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 10 



ADH71626 

ID ADH71626 standard; protein; 898 AA. 
XX 

AC ADH71626; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21i SEQ ID NO: 522. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2. 
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XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog Fh, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S f GangollI EA, Gerlach VL, Gorman L, Giinther E, Guo X; 

PI Gusev VY f Herrmann JL f Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P f Shenoy SG, Shimkets RA; 

PI Smithson G r Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71625. 



XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders , e.g. cancer, diabetes , infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 522; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator , 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 898 AA; 



Query Match 100.0%; Score 559; DB 8; Length 898; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 494 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 553 



Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 
Db 554 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 597 



RESULT 11 
AAU79939 

ID AAU79939 standard; protein; 899 AA. 
XX 

AC AAU79939; 
XX 

DT 15-JUL-2002 (first entry) 
XX 

DE Human UNC5-like protein NOV1 . 
XX 

KW Human; NOVX polypeptide; cardiomyopathy; atherosclerosis; cancer; 

KW cell signal processing; metabolic pathway modulation; cancerous tissue; 

KW antibody; diabetes; transgenic animal; UNC5-like protein; NOV1; 

KW chromosome 13. 

XX 

OS Homo sapiens. 
XX 

PN WO200229038-A2. 
XX 



PD ll-APR-2002. 
XX 

PF 04-OCT-2001; 2001WO-US031377 . 
XX 

PR 04-OCT-2000; 2000US-0237862P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Herrmann JL, Rastelli L, Shimkets RA; 
XX 

DR WPI; 2002-340104/37. 

DR N-PSDB; ABK49422. 
XX 

PT Novel isolated NOVX polypeptide, and encoded polynucleotide, useful for 

PT treating cardiomyopathy, artherosclerosis, and cancer. 

XX 

PS Claim 1; Page 9; 180pp; English. 
XX 

CC The present invention relates to a new NOVX polypeptide having a 900 

CC (NOV1), 4349 (NOV2), 940 (NOV3) , 798 (NOV4), 865 (NOV5) , or 331 (NOV6) 

CC residue amino acid sequence, as given in the specification. The novel 

CC polypeptide, and its encoding polynucleotide, are used to treat 

CC cardiomyopathy, atherosclerosis, cancer or a disease related to cell 

CC signal processing and metabolic pathway modulation, in a human. Detecting 

CC the polypeptide or polynucleotide is useful for identifying cancerous 

CC tissue. The antibody can be used to treat diabetes or cancer. The host 

CC cells can be used to produce non-human transgenic animals useful in drug 

CC screening. The present amino acid sequence is that of the human UNC5-like 

CC protein NOV1 of the invention. This sequence is encoded by the human UNC5 

CC -like NOV1 gene located on chromosome 13 

XX 

SQ Sequence 899 AA; 

Query Match 100.0%; Score 559; DB 5; Length 899; 
Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSN^ITYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 495 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 555 LS PI VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEG 598 



RESULT 12 
ADG42569 

ID ADG42569 standard; protein; 899 AA. 
XX 

AC ADG42569; 
XX 

DT 26-FEB-2004 (first entry) 
XX 

DE Novel human NOV1 . 
XX 

KW cytostatic; gene therapy; NOVX-agonist; NOVX-antagonist ; pharmaceutical; 



KW NOVX-associated disorder; cancer; human. 
XX 

OS Homo sapiens. 
XX 

PN US2003204052-A1. 
XX 

PD 30-OCT-2003. 
XX 

PF 04-OCT-2001; 2001US-00970944 . 
XX 

PR 04-OCT-2000; 2000US-0237862P. 
XX 

PA (HERR/) HERRMANN J L. 

PA (RAST/) RASTELLI L. 

PA (SHIM/) SHIMKETS R A. 
XX 

PI Herrmann JL, Rastelli L, Shimkets RA; 
XX 

DR WPI; 2003-900673/82. 

DR N-PSDB; ADG42568. 
XX 

PT New NOVX gene or NOVX-specif ic antibody, useful for preparing a 

PT composition for treating or preventing a NOVX-associated disorder , e.g., 

PT cancer. 

XX 

PS Claim 1; SEQ ID NO 2; 118pp; English. 
XX 

CC The invention describes a new isolated polypeptide comprising: a 

CC polypeptide or its mature form comprising a sequence not given in the 

CC specification; or a variant of (A) , where one or more amino acid residues 

CC in the variant differs in no more than 15% from the amino acid sequence 

CC of the mature form. The pharmaceutical composition may be administered 

CC via oral, transdermal, rectal or parenteral route. The polypeptide, 

CC nucleic acid or antibody is useful for preparing a composition for 

CC treating or preventing a NOVX-associated disorder, e.g., cancer. This is 

CC the amino acid sequence of a human NOVX protein. 

XX 

SQ Sequence 899 AA; 



Query Match 100.0%; Score 559; DB 7; Length 899; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LS PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II 
Db 555 LS PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 13 
ADH71636 

ID ADH71636 standard; protein; 899 AA. 
XX 

AC ADH71636; 



XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21n SEQ ID NO: 532. 
XX 

KW human; cytostatic; immunomoduiator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2. 
XX 
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DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71635. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 



XX 

PS Example 21; SEQ ID NO 532; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic,, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 
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SQ Sequence 899 AA; 

Query Match 100.0%; Score 559; DB 8; Length 899; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 
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ID ADH71642 standard; protein; 899 AA. 
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DT 25-MAR-2004 (first entry) 
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DE Human protein of the invention NOV21q SEQ ID NO: 538. 
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KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 
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Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 
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Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 
Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 
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WPI; 2004-081935/08. 
N-PSDB; ADH71641. 

New NOVX polypeptides and nucleic acid molecules useful for preventing or 
treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 
obesity, and in chromosome mapping, tissue typing or phafmacogenomics . 

Example 21; SEQ ID NO 538; 1880pp; English. 

The invention relates to a novel isolated polypeptide (NOVX) . A 
polypeptide of the invention has cytostatic, immunomodulator , 
neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 
antilipaemic activity, and may have a use in gene therapy, and as a 
vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 
any of the 303 fully defined nucleotide sequences given in the 
specification. The polypeptide is useful in the manufacture of a 
medicament for treating a syndrome associated with a human disease. The 
polypeptide, polynucleotide and antibody are useful in diagnosing, 
treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 
Alzheimer f s disease, Parkinson^ disease, obesity, diabetes, infectious 
diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 
further used as hybridisation probes, in chromosome mapping, tissue 
typing, preventive medicine, and pharmacogenomics . The present sequence 
represents a NOVX polypeptide of the invention. 
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SQ Sequence 899 AA; 



Query Match 100.0%; Score 559; DB 8; Length 899; 

Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I II I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I 1 I I I I I I II I I I I I 
Db 495 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEG 104 
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PR 05-NOV-2002; 2002US-0423798P . 

PR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71647. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 544; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator , 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 100.0%; Score 559; DB 8; Length 899; 
Best Local Similarity 100.0%; Pred. No. 3e-57; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 555 LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 598 



Search completed: March 1, 2005, 08:56:49 
Job time : 24.793 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 08:42:47 



Search time 6.2315 Seconds 

(without alignments) 

1245.848 Million cell updates/sec 



US-10-624-932-2_COPY_495_598 
559 

1 TSNMTYGTFNFLGGRLMIPN 



. CGEPSPDSWSLRLKKQSCEG 104 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing : 



513545 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/ l/iaa/5A_COMB . pep : * 

2 : / cgn2_6/ptodata/ l/iaa/5B_COMB . pep : * 

3: /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/ 1/iaa/ 6B_COMB . pep : * 

5: /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: * 

6: / cgn2_6/ptodata/l/iaa/backfilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-808-982-6 

Sequence 6, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 



; ZIP: 94104 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
US-08-808-982-6 



Query Match 98.9%; Score 553; DB 2; Length 557; 

Best Local Similarity 99.0%; Pred. No. 1.2e-60; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 153 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 212 



Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 213 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLALKKQSCEG 256 



RESULT 2 

US-09-306-902A-6 

Sequence 6, Application US/09306902A 
Patent No. 6277585 

GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo, E. David 
Hink, Lindsay 
Masu, Masayuki 
Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 



ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 

FILING DATE: 07-May-1999 

CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-09-306-902A-6 



Query Match - 98.9%; Score 553; DB 3; Length 557; 

Best Local Similarity 99.0%; Pred. No. 1.2e-60; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I i I I I I I I I I I I 
Db 153 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 212 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I II I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 213 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLALKKQSCEG 256 



RESULT 3 
US-08-808-982-5 

Sequence 5, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 



COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 898 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: not relevant 
MOLECULE TYPE: peptide 
US-08-808-982-5 

Query Match 98.9%; Score 553; DB 2; Length 898; 

Best Local Similarity 98.1%; Pred. No. 2.2e-60; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 555 LSPWSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 4 

US-09-306-902A-5 

; Sequence 5, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 



COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 

FILING DATE: 07-May-1999 

CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36, 627 

REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 

TYPE: amino acid 
; STRANDEDNESS : not relevant 

; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-306-902A-5 



Query Match 98.9%; Score 553; DB 3; Length 898; 

Best Local Similarity 98.1%; Pred. No. 2.2e-60; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I I I II I I I I II I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I. 
Db 555 LSPWSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 5 

US-09-949-016-10665 

; Sequence 10665, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

; CURRENT. FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 



PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 10665 
LENGTH: 769 
TYPE: PRT 
ORGANISM: Human 
US-09-949-016-10665 

Query Match 60.3%; Score 337; DB 4; Length 769; 

Best Local Similarity 55.6%; Pred. No. 2.1e-33; 

Matches 55; Conservative 24; Mismatches 20; Indels 0; Gaps 0; 

Qy 6 YGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I : I I III I : : I I : I : I I I I I I I I : I : : I I : I : I : I : I : I I : I I I I : I : I 
Db 371 FGSFNSLGGHLIVPNSGVSLLIPAGAIPQGRVYEMYVTVHRKETMRPPMDDSQTLLTPW 430 

Qy 66 SCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I : I I II : I : : I : II I : : I 
Db 431 SCGPPGALLTRPWLTMHHCADPNTEDWKILLKNQAAQG 4 69 



RESULT 6 
US-08-808-982-7 

Sequence 7, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 



TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 943 amino acids 
TYPE: amino acid 
STRANDEDNESS: not relevant 
TOPOLOGY: not relevant 
MOLECULE TYPE: peptide 
US-08-808-982-7 

Query Match 49.6%; Score 277; DB 2; Length 943; 

Best Local Similarity 52.9%; Pred. No. 9.1e-26; 

Matches 55; Conservative 17; Mismatches 30; Indels 2; Gaps 2; 

Qy 2 SNMTYGTFNFLGGRLMIPNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLA-GCQTL 60 

I : III I I I I I I I I I : I I I : I I I I : I I I : : I I : : I I I I I : I I I : 
Db 540 SSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTEST-LPLSEGSQTV 598 

Qy 61 LSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : I I I I.I : I : I I I I : I I I : : I 

Db 599 LS PS VTCGPTGLLLCRPWLTVPHCAE VTAGDWI FQLKTQAHQG 642 

RESULT 7 

US-09-306-902A-7 

Sequence 7 , Application US/09306902A 
Patent No. 6277585 

GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo , E. David 
Hink, Lindsay 
Masu, Masayuki 
Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET* NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 



TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 7: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 943 amino acids 

; TYPE: amino acid 

STRANDEDNESS : not relevant 
; TOPOLOGY: not relevant 

; MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-306-902A-7 



Query Match 49.6%; Score 277; DB 3; Length 943; 

Best Local Similarity 52.9%; Pred. No. 9. le-26; 

Matches 55; Conservative 17; Mismatches 30; Indels 2; Gaps 2; 



Qy 2 SNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTL 60 

I : I I I I I I I I I I I I : I I I : I I I I : I I I : : I I : : I I I I I : I I I : 
Db , 540 SSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTEST-LPLSEGSQTV 598 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : I I I I I : I : I I I I : I I I : : I 

Db 599 LS PSVTCGPTGLLLCRPWLTVPHCAEVIAGDWI FQLKTQAHQG 642 



RESULT 8 

US-09-969-532-32 

; Sequence 32, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

FILE REFERENCE: LEX-0244-USA 
; CURRENT APPLICATION NUMBER: US/ 09/969, 532 
; CURRENT FILING DATE: 2001-10-02 
; PRIOR APPLICATION NUMBER: US 60/237,280 
; PRIOR FILING DATE: 2000-10-02 
; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 32 

LENGTH: 655 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-32 

Query Match 43.6%; Score 243.5; DB 4; Length 655; 

Best Local Similarity 45.2%; Pred. No. 8.9e-22; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I : : I I I I : I I I I I III : | I I : :: : : I I I : I 
Db 242 TELRTTGVFGHLGGRLVMPNTGVSLLI PHGAI PEENSWEI YMSINQGEP- S LQS DGS EVL 300 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I 
Db 301 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 344 



RESULT 9 

US-09-969-532-30 

; Sequence 30, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280. 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 30 

LENGTH: 666 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-30 

Query Match 43.6%; Score 243.5; DB 4; Length 666; 

Best Local Similarity 45.2%; Pred. No. 9.1e-22; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I :: I I I I : I I I I I III : I I I : : : : : ! I I : I 
Db 253 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 311 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I 
Db 312 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 355 



RESULT 10 
US-09-969-532-28 

; Sequence 28, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS: 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 28 

LENGTH: 669 

TYPE: PRT 
; ORGANISM: homo sapiens 



US-09-969-532-28 



Query Match 43.6%; Score 243.5; DB 4; Length 669; 

Best Local Similarity 45.2%; Pred. No. 9.2e-22; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 



Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I : : I I I I : I I I I I III : I II : : : : : I I I : I 
Db 256 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 314 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I: I I I I ::: I I I : I I : I : I : : I I I : : : I 
Db 315 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 358 



RESULT 11 
US-09-969-532-26 

; Sequence 26, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 26 

LENGTH: 680 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-26 

Query Match 43.6%; Score 243.5; DB 4; Length 680; 

Best Local Similarity 45.2%; Pred. No. 9.4e-22; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 



Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I :: I I I I : I I I I I III : I I I : : : : : I I I : I 

Db 267 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 325 

Qy 61 LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 104 

II I I : I I I I ::: I I I : I I : I : I : : I I I : : : I 

Db 326 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 369 



RESULT 12- 
US-09-969-532-16 

; Sequence 16, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 



; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 16 

LENGTH: 886 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-16 

Query Match 43.6%; Score 243.5; DB 4; Length 886; 

Best Local Similarity 45.2%; Pred. No. 1.3e-21; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I II : : I I I I : I I I I I III : I I I : : : : : I I I : I 
Db 473 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 531 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : II I I : : : I I I : I I : I : I : : I I I : : : I 
Db 532 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 575 



RESULT 13 
US-09-969-532-14 

; Sequence 14, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS: 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 14 
; LENGTH: 897 
; TYPE: PRT 

; ORGANISM: homo sapiens 
US-09-969-532-14 

Query Match 43.6%; Score 243.5; DB 4; Length 897; 

Best Local Similarity 45.2%; Pred. No. 1.3e-21; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I :: I I I I : I II I I III : I | I : : : : : I | I : I 
Db 484 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 542 



Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I 
Db 543 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 586 



RESULT 14 
US-09-969-532-12 

; Sequence 12, Application US/09969532 

; Patent No. 6777232 

; . GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX- 02 4 4 -USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 12 

LENGTH: 900 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-12 

Query Match 43.6%; Score 243.5; DB 4; Length 900; 

Best Local Similarity 45.2%; Pred. No. 1.4e-21; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 
Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 



Db 487 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 545 



Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I 
Db 546 LS PEVTCGP PDMI VTTPFALT I PHCADVS SEHWNI HLKKRTQQG 589 



RESULT 15 
US-09-969-532-10 

; Sequence 10, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX- 024 4 -USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS: 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 



; SEQ ID NO 10 

LENGTH: 911 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-10 

Query Match 43.6%; Score 243.5; DB 4; Length 911; 

Best Local Similarity 45.2%; Pred. No. 1.4e-21; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I :: I I I I : I I I I I III : I I I : : : : : I I I : I 
Db 498 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 556 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : I : ! : : I I I : : : I 
Db 557 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 600 



Search completed: March 1, 2005, 09:05:53 
Job time : 7.2315 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 08:46:18 ; Search time 4.05992 Seconds 

(without alignments) 
2464.715 Million cell updates/sec 

US-10-624-932-2_COPY_4 95_598 
559 

1 TSNMTYGTFNFLGGRLMIPN CGEPSPDSWSLRLKKQSCEG 104 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



283416 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIR_79:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID. Description 



1 


163.5 


29.2 


1736 


2 


A47747 


tight junction pro 


2 


162.5 


29.1 


1745 


2 


A46431 


tight junction-ass 


3 


96 


17.2 


919 


2 


T32541 


unc-5 protein - Ca 


4 


96 


17.2 


947 


1 


B44294 


unc-5 protein, Ion 


5 


76 


13.6 


346 


2 


S21265 


acetylserotonin 0- 


6 


73 


13.1 


1848 


2 


S37771 


ankyrin, erythrocy 


7 


73 


13.1 


1862 


2 


149502 


ankyrin - mouse 


8 


72 


12.9 


1856 


2 


B35049 


ankyrin 1, erythro 


9 


72 


12.9 


1880 


2 


A35049 


ankyrin 1, erythro 


10 


72 


12.9 


1881 


1 


SJHUK 


ankyrin 1, erythro 


11 


70 


12.5 


384 


2 


S73122 


carbamoyl phosphat 


12 


69.5 


12.4 


369 


2 


D72240 


tRNA guanine trans 


13 


69 


12.3 


345 


2 


148780 


Stral/Eplg2 protei 



14 


69 


12. 


3 


345 


2 


158406 


LERK-2 - rat 


15 


68 


12. 


2 


346 


2 


S46993 


elk ligand - human 


16 


68 


12. 


2 


388 


1 


VCVGAC 


coat protein - art 


17 


68 


12. 


2 


803 


2 


S76106 


hypothetical prote 


18 


68 


12. 


2 


999 


2 


AG2413 


hypothetical prote 


19 


66.5 


11. 


9 


285 


2 


B83883 


sugar transport sy 


20 


66.5 


11. 


9 


544 


2 


C95854 


probable trehalose 


21 


66 


11. 


8 


201 


2 


D82590 


hypothetical prote 


22 


66 


11. 


8 


467 


2 


A60667 


cysteine proteinas 


23 


65. 5 


11. 


7 


217 


2 


T47175 


hypothetical prote 


24 


65.5 


11. 


7 


235 


2 


G72484 


hypothetical prote 


25 


65.5 


11. 


7 


411 


2 


A55610 


corticotropin^rele 


26 


65 


11. 


6 


388 


2 


S24926 


coat protein, 41K 


27 


64.5 


11. 


5 


467 


2 


A45629 


cysteine proteinas 


28 


64 


11. 


4 


579 


2 


G75142 


abc transporter, A 


29 


63.5 


11. 


4 


256 


2 


S74928 


hypothetical prote 


30 


63.5 


11. 


4 


400 


2 


D83892 


transcription regu 


31 


62.5 


11. 


2 


236 


2 


T40833 


haloacid dehalogen 


32 


62.5 


11. 


2 


255 


2 


AD1946 


hypothetical prote 


33 


62.5 


11. 


2 


260 


2 


G70568 


hypothetical prote 


34 


62.5 


11. 


2 


363 


2 


B97199 


uncharacterized co 


35 


62.5 


11. 


2 


498 


1 


VHIVA6 


nucleoprotein - in 


36 


62 


11. 


1 


253 


2 


G72598 


probable ABC trans 


37 


62 


11. 


1 


270 


2 


B83387 


hypothetical prote 


38 


61. 5 


11. 


0 


170 


2 


AI3322 


hypothetical prote 


39 


61.5 


11. 


0 


434 


1 


DEEBHT 


histidinol dehydro 


40 


61.5 


11. 


0 


445 


2 


D81716 


hypothetical prote 


41 


61.5 


11. 


0 


710 


2 


D72091 


type III secretion 


42 


61.5 


11. 


0 


710 


2 


C86531 


low calcium respon 


43 


61.5 


11. 


0 


1022 


2 


T49683 


probable oxoglutar 


44 


61.5 


11. 


0 


1396 


2 


T10627 


hypothetical prote 


45 


61 


10. 


9 


99 


2 


S42360 


capsicein - Phytop 



ALIGNMENTS 



RESULT 1 
A47747 

tight junction protein ZO-1 - human 
C; Species: Homo sapiens (man) 

C;Date: 02-Jun-1994 #sequence_revision 02-Jun-1994 #text_change 09-Jul-2004 
C; Accession: A47747 • 

R;Willott, E. ; Balda, M.S.; Fanning, A.S.; Jameson, B. ; Van Itallie, C. ; 
Anderson, J.M. 

Proc. Natl. Acad. Sci. U.S.A. 90, 7834-7838, 1993 

A;Title: The tight junction protein ZO-1 is homologous to the Drosophila discs- 
large tumor suppressor protein of septate junctions. 
A; Reference number: A47747; MUID: 93361541 ; PMID: 8395056 
A;Accession: A47747 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-1736 <WIL> 

A; Cross-references: UNIPROT : Q07157 ; GB:L14837; NID:g292937; PIDN: AAA02891 . 1; 
PID:g292938 

C; Keywords: alternative splicing; membrane protein; phosphoprotein 
F; 15-94 /Domain : GLGF domain homology <GLG1> 



F;181-248/Domain: GLGF domain homology <GLG2> 
F;416-486/Domain: GLGF domain homology <GLG3> 
F; 633-7 82/ Domain : guanylate kinase homology <GKI> 

Query Match 29.2%; Score 163.5; DB 2; Length 1736; 

Best Local Similarity 41.6%; Pred. No. 3.6e-09; 

Matches 37; Conservative 12; Mismatches 35; Indels 5; Gaps 2; 



Qy 7 GTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPL— AGCQTLLSPI 64 

III III I I : I : : I I I I I I III : : : II : I I I I I : 

Db 1626 GIFNSNGGVLSSIETGVSIIIPQGAIPEGVEQEIYFKVCRDNSILPPLDKEKGETLLSPL 1685 

Qy 65 VSCGPPGVLLTRPVILAMDHCGEPSPDSW 93 

Mill: : I I I : I I I : I 

Db 1686 VMCGPHGLKFLKPVELRLPHC DPKTW 1711 



RESULT 2 
A46431 

tight junction-associated protein ZO-1 - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 
C;Accession: A46431 

R;Itoh, M.; Nagafuchi, A.; Yonemura, S.; Kitani-Yasuda, T.; Tsukita, S.; 
Tsukita, S. 

J. Cell Biol. 121, 491-502, 1993 

A;Title: The 220-kD protein colocalizing with cadherins in non-epithelial cells 

is identical to ZO-1, a tight junction-associated protein in epithelial cells: 

cDNA cloning and immunoelectron microscopy. 

A; Reference number: A46431; MUID: 93252986; PMID: 8486731 

A; Access ion: A46431 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-1745 <ITO> 

A; Cross-references: UNIPROT : P39447 ; GB:D14340; NID:g303709; PIDN : BAA03274 . 1; 

PID:dl003784; PID:g303710 

A; Experimental source: F9 cells 

A;Note: sequence extracted from NCBI backbone (NCBIN : 131200, NCBIP : 131201 ) 

F;27-106/Domain: GLGF domain homology <GLG1> 

F; 42 8-4 98 /Domain : GLGF domain homology <GLG3> 

F; 64 5-7 94 /Domain : guanylate kinase homology <GKI> 

Query Match 29.1%; Score 162.5; DB 2; Length 1745; 

Best Local Similarity 41.6%; Pred. No. 4.6e-09; 

Matches 37; Conservative 12; Mismatches 35; Indels 5; Gaps 2; 

Qy 7 GTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPL — AGCQTLLSPI 64 

Til III I I : I :: I I I I I I III : : : II : I I I I I : 

Db 1635 GIFNSNGGVLSSIETGVSIIIPQGAIPEGIEQEIYFKVCRDNSILPPLDKEKGETLLSPL 1694 

Qy 65 VSCGPPGVLLTRPVILAMDHCGEPSPDSW 93 

Mill: : I I I : I I I : I 
Db 1695 VMCGPHGLKFLKPVELRLPHC DPKTW 1720 



RESULT 3 
T32541 



unc-5 protein - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 09-Jul-2004 
C; Accession: T32541 
R; Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A; Reference number: Z21187 

A;Accession: T32541 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-919 <LAT> 

A;Cross-references: UNIPROT : 044171; EMBL :AF03 6698 ; PIDN : AAB88355 . 1 ; 
GSPDB:GN00022; CESP : B0273 . 4a 

A; Experimental source: strain Bristol N2; clone B0273 
C; Genetics : 

A;Gene: unc-5; CESP : B0273 . 4a 
A;Map position: 4 

A;Introns: 41/3; 108/1; 142/3; 201/1; 323/2; 553/1; 858/3 

C; Superf amily : unc-5 protein; immunoglobulin homology; SH3 homology; 

thrombospondin type 1 repeat homology 

Query Match 17.2%; Score 96; DB 2; Length 919; 

Best Local Similarity 25.1%; Pred. No. 0.034; 

Matches 27; Conservative 21; Mismatches 47; Indels 10; Gaps 3; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

: I : : | | | : : | | : : | | : | : : | | : : | : : 

Db 500 SQNIV7VAQIDSNGARLSLSKSGARLIWELAVEGEKM — LYLAVSDTLTDQPHLKPI ESA 557 

Qy 61 LSPIVSCGPPGV LLTRPVI LAMDHCGEPSP-DSWSLRL 97 

Ml:: | | : | | M : : : | | | | : | | 

Db 558 LS PVI VI GQCDVSMS AHDNI LRRPVWS FRHCAST FPRDNWQFTL 602 



RESULT 4 
B44294 

unc-5 protein, long form - Caenorhabditis elegans 
N; Contains: unc-5 protein, short form 
C; Species: Caenorhabditis elegans 

C;Date: 30-Apr-1993 #sequence_revision 28-Jul-1995 #text_change 09-Jul-2004 
C;Accession: B44294; T32540; A44294 

R;Leung-Hagesteijn, C. ; Spence, A.M.; Stern, B.D.; Zhou, Y. ; Su, M.W. ; 
Hedgecock, E.M.; Culotti, J.G. 
Cell 71, 289-299, 1992 

A; Title: UNC-5, a transmembrane protein with immunoglobulin and thrombospondin 

type 1 domains, guides cell and pioneer axon migrations in C. elegans. 

A; Reference number: A44294; MUID: 93046629 ; PMID: 1384987 

A; Contents: variety Bergerac 

A;Accession: B44294 

A;Molecule type: DNA 

A; Residues: 1-947 <LEU> 

A;Cross-references: UNIPROT : 04 4 171; GB:S47168; NID:g258527; PIDN : AAB23867 . 1; 
PID:g258529 

A;Note: sequence extracted from NCBI backbone (NCBIN : 116668, NCBIN: 116670, 
NCBIN:116672, NCBIN : 116674 , NCBIN : 116676, NCBIN : 116678 f NCBIN : 116680, 
NCBIN:116682, NCBIN : 116685, NCBIP : 118648 ) 



A; Note: authors translated the codon CTA for residue 642 as Val; sequence shown 
follows the authors 1 translation 

A; Note: mRNA lacking the first exon is equally prevalent 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A; Reference number: Z21187 

A; Accession: T32540 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-947 <LAT> 

A;Cross-references: EMBL: AF036698; PIDN: AAB88356 . 1; GSPDB : GN00022 ; CESP : B0273 . 4b 
A; Experimental source: strain Bristol N2; clone B0273 
C; Genetics : 
A; Gene : unc-5 
A;Map position: 4 

A;Introns: 28/1; 69/3; 136/1; 170/3; 229/1; 351/2; 581/1; 886/3 
C; Function: 

A; Description: required for guidance of pioneering axons and cells migrating 
dorsally along the body wall; proposed to be a receptor on the surface of the 
motile cells 

C;Superf amily : unc-5 protein; immunoglobulin homology; SH3 homology; 
thrombospondin type 1 repeat homology 

C; Keywords: alternative splicing; duplication; glycoprotein; receptor; 
transmembrane protein 

F; 30-947/Product : unc-5 protein, short form #status predicted <ALT> 
F; 4 6- 116/ Domain : immunoglobulin homology <IM1> 
F; 153-211/Domain: immunoglobulin homology <IM2> 

F;229-300/Domain: thrombospondin type 1 repeat homology #status atypical <THR1> 
F; 3 01- 35 4 /Domain : thrombospondin type 1 repeat homology <THR2> 
F; 365-390/Domain: transmembrane #status predicted <TMM> 
F;512-559/Domain: SH3 homology <SH3> 

F; 53-114, 65-112, 160-209/Disulf ide bonds: #status predicted 
F;206/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 17.2%; Score 96; DB 1; Length 947; 

Best Local Similarity 25.7%; Pred. No. 0.035; 

Matches 27; Conservative 21; Mismatches 47; Indels 10; Gaps 3; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

: I : : | | | : : | | : : | | : | : : | | : : | : : 

Db 528 SQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM — LYLAVSDTLTDQPHLKPIESA 585 

Qy 61 LSPIVSCGPPGV LLTRPVI LAMDHCGEPSP-DSWSLRL 97 

III:: I I : I III::: II I hi I 

Db 586 LSPVIVIGQCDVSMSAHDNILRRPVWSFRHCASTFPRDNWQFTL 630 



RESULT 5 
S21265 

acetylserotonin O-methyltransf erase (EC 2.1.1.4) - chicken 
N; Alternate names: hydroxyindole O-methyltransf erase 
C; Species: Gallus gallus (chicken) 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text_change 09-Jul-2004 
C; Accession: S21265; S22154 

R;Voisin, P.; Guerlotte, J.; Bernard, M. ; Collin, J. P.; Cogne, M. 
Biochem. J. 282, 571-576, 1992 



A; Title: Molecular cloning and nucleotide sequence of a cDNA encoding 

hydroxyindole O-methyltransf erase from chicken pineal gland. 

A; Reference number: S21265; MUID : 92189600 ; PMID: 1372168 

A; Accession: S21265 

A; Molecule type: mRNA 

A; Residues: 1-34 6 <VOI> 

A; Cross-references: UNIPROT : Q92056; EMBL:X62309; NID:g62925; PIDN : CAA44 189 . 1 ; 
PID:g62926 

C ; Super family : O-methyltrans f erase 

C; Keywords: melatonin biosynthesis; methyltransf erase; S-adenosylmethionine 

Query Match 13.6%; Score 76; DB 2; Length 346; 

Best Local Similarity 23.8%; Pred. No. 1.6; 

Matches 30; Conservative 20; Mismatches 36; Indels 40; Gaps 6; 

Qy 13 GGRL MIPNTGISL LIPP DAIPRGKIYE 39 

ill : || ::: |:|| |:|| :| 

Db 191 GGALAQECVFLYPNCTVTIYDLPKWQVAKERLVPPEERRIAFHEGDFFKDSIPEADLYI 250 

Qy 4 0 I YLTLHKPEDVRLPLAGCQTLLS PIV- SCGPPGVXLTRPVI LAMDHCGEPS PDSWSLRLK 98 

: II : I : I : I I : : : I I I : I : I : I I : I I : 

Db 251 LSKILHDWDDKK CRQLLAEVYKACRPGGGVLLVESLLSEDRSGPVETQLYSLNML 305 

Qy 99 KQSCEG 104 

I : II 

Db 306 VQT-EG 310 



RESULT 6 
S37771 

ankyrin, erythrocyte - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 27-May-1994 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: S37771 

R;Birkenmeier, C.S.; White, R.A. ; Peters, L.L.; Hall, E.J.; Lux, S.E.; Barker, 
J.E. 

J. Biol. Chem. 268, 9533-9540, 1993 

A; Title: Complex patterns of sequence variation and multiple 5' and 3 1 ends are 

found among transcripts of the erythroid ankyrin gene. 

A; Reference number: S37771; MUID: 93252825 ; PMID: 8486643 

A; Accession: S37771 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-1848 <BIR> 

A; Cross-references: UNIPROT : Q61302 ; EMBL:X69063; NID:g311816; PIDN : CAA4 8801 . 1 ; 
PID:g311817 

C; Super family : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing 
F;48-80/Domain: ankyrin repeat homology <AN01> 
F; 8 1-1 13 /Domain : ankyrin repeat homology <AN02> 
F; 114-14 6/Domain: ankyrin repeat homology <AN03> 
F; 14 7-17 5 /Domain : ankyrin repeat homology <AN04> 
F; 176-2 08 /Domain : ankyrin repeat homology <AN05> 
F;209-241/Domain: ankyrin repeat homology <AN06> 
F;242-274/Doraain: ankyrin repeat homology <AN07> 
F;275-307/Domain: ankyrin repeat homology <AN08> 
F;308-340/Domain: ankyrin repeat homology <AN09> 



F;341 
F;374 
F;407 
F;440 
F;473 
F;506 
F;539 
F;572 
F;605 
F;638 
F;671 
F;704- 
F;737- 
F;770« 



-373/Domain: 
-406/Domain: 
-439/Domain: 
-472/Domain: 
-505/Domain: 
-53 8 /Domain: 
-571/Domain : 
-60 4 /Domain: 
-637/Domain : 
-670 /Domain: 
-703/Domain: 
-736/Domain : 
-769/Domain: 
-802/Domain: 



ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 



repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 



homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 



<AN10> 
<AN11> 
<AN12> 
<AN13> 
<AN14> 
<AN15> 
<AN16> 
<AN17> 
<AN18> 
<AN19> 
<AN20> 
<AN21> 
<AN22> 
<AN23> 



Query Match 13.1%; Score 73; DB 2; Length 1848; 

Best Local Similarity 33.3%; Pred. No. 22; 

Matches 23; Conservative 11; Mismatches 27; Indels 



8; Gaps 



3; 



Qy 

Db 



20 NTGISLLIPPD — AI PRGKI YEI YLTLHKPEDVRL — PLAGCQTLLSPIVSCGPPGVLLT 75 
: I : : : I I I II | III:: I I I : I I I : : I I I 

943 HNGLRWIPPRTCAAPT. RITCRLVKPQKLNTPPPLAEEEGLASRIIALGPTGAQFL 998 



Qy 



Db 



76 RPVT LAMDH 84 
III: : I 
999 SPVIVEIPH 1007 



RESULT 7 
149502 

ankyrin - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 09-Jul-2004 
C; Accession: 149502 

R; White, R.A. ; Birkenmeier, C.S.; Peters, L.L.; Barker, J.E.; Lux, S.E. 
Mamm. Genome 3, 281-285, 1992 

A; Title: Murine erythrocyte ankyrin cDNA: Highly conserved regions of the 
regulatory domain. 

A;Reference number: 149502; MUID: 92345717; PMID: 1386265 
A;Accession : 149502 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 1-1862 <RES> 

A;Cross-references: UNIPROT:Q02357; GB:M84756; NID:gl91939; PIDN : AAA37236 . 1; 
PID:gl91940 
C; Genetics : 
A; Gene: Ank-1 

C;Superfamily : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing 
F; 40-72/Domain: ankyrin repeat homology <AN01> 
F;73-105/Domain: ankyrin repeat homology <AN02> 
F; 10 6- 13 8 /Domain : ankyrin repeat homology <AN03> 
F; 13 9- 167 /Domain : ankyrin repeat homology <AN04> 
F; 168-200/Domain: ankyrin repeat homology <AN05> 
F;201-233/Domain: ankyrin repeat homology <AN06> 
F;234-266/Domain: ankyrin repeat homology <AN07> 
F;267-299/Domain: ankyrin repeat homology <AN08> 



F;300- 
F;333- 
F;366- 
F;399- 
F;432- 
F;465- 
F;498- 
F;531- 
F;564- 
F;597- 
F;630- 
F;663- 
F;696- 
F;729- 
F;762- 



332/Domain : 
3 65 /Domain: 
398/Domain: 
431/Domain: 
464/Domain : 
497/Domain: 
530/Domain : 
563/Domain : 
596/Domain: 
62 9 /Domain: 
662/Domain: 
695/Domain: 
728/Domain: 
761/Domain: 
794/Domain: 



ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 



repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 



homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 



<AN09> 
<AN10> 
<AN11> 
<AN12> 
<AN13> 
<AN14> 
<AN15> 
<AN16> 
<AN17> 
<AN18> 
<AN19> 
<AN20> 
<AN21> 
<AN22> 
<AN23> 



Query Match 13.1%; Score 73; DB 2; Length 1862; 

Best Local Similarity 33.3%; Pred. No. 23; 

Matches 23; Conservative 11; Mismatches 27; Indels 8; 



Gaps 



3; 



Qy 

Db 



20 NTGISLLIPPD — AI PRGKI YEI YLTLHKPEDVRL — PLAGCQTLLSPIVSCGPPGVLLT 75 
: I : : : I I I II I III:: III : I I I : : I I I 

927 HNGLRWI PPRTCAAPT RITCRLVKPQKLNTPPPLAEEEGLASRIIALGPTGAQFL 982 



Qy 

Db 



76 RPVILAMDH 84 

III: : I 
983 SPVIVEIPH 991 



RESULT 8 
B35049 

ankyrin 1, erythrocyte splice form 3 - human 
N;Alternate names: ankyrin 2.1, erythrocyte; ankyrin-R 
N; Contains: ankyrin 2.2, erythrocyte 
C; Species: Homo sapiens (man) 

C;Date: 17-Nov-1995 #sequence_revision 17-Nov-1995 #text_change 10-Jul-1998 
C; Accession: B35049 

R;Lambert, S.; Yu, H.; Prchal, J.T.; Lawler, J.; Ruff, P.; Speicher, D. ; Cheung, 
M.C.; Kan, Y.W. ; Palek, J. 

Proc. Natl. Acad. Sci. U.S.A. 87, 1730-1734, 1990 

A; Title: cDNA sequence for human erythrocyte ankyrin. 

A; Reference number: A35049; MUID: 90175370; PMID: 1689849 

A; Accession: B35049 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-1856 <LAM> 

C; Genetics : 

A; Gene: GDB : ANK1 ; ANK 

A;Cross-references: GDB: 118737; OMIM: 182900 
A; Map position: 8pll . 2-8pll . 2 

C; Superf amily : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing 

F;2-1856/Product : ankyrin 1, erythrocyte form 3 #status predicted <MAT> 
F;2-1513, 1676-1856/Product: ankyrin 2.2, erythrocyte #status predicted <MA2> 
F; 4 4-7 6/ Domain: ankyrin repeat homology <AN01> 
F; 77-109/Domain: ankyrin repeat homology <AN02> 
F;110-142/Domain: ankyrin repeat homology <AN03> 



F;143- 
F;172- 
F;205- 
F;238- 
F;271- 
F;304- 
F;337- 
F;370- 
F;403- 
F;436- 
F;469- 
F;502- 
F;535- 
F;568- 
F;601- 
F;634- 
F;667- 
F;700- 
F;733- 
F;766- 



171/ Domain 
204/Domain 
237/Domain 
270/Domain 
303/Domain 
336/Domain 
369/Domain 
4 02 /Domain 
435/Domain 
4 68 /Domain 
501/Domain 
534/Domain 
567/Domain 
600/Domain 
633/Domain 
666/Domain 
699/Domain 
732/Domain 
765/Domain 
798/Domain 



ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 
ankyrin 



repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 
repeat 



homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 
homology 



<AN04> 
<AN05> 
<AN06> 
<AN07> 
<AN08> 
<AN09> 
<AN10> 
<AN11> 
<AN12> 
<AN13> 
<AN14> 
<AN15> 
<AN16> 
<AN17> 
<AN18> 
<AN19> 
<AN20> 
<AN21> 
<AN22> 
<AN23> 



Query Match 12.9%; Score 72; DB 2; Length 1856; 

Best Local Similarity 33.3%; Pred. No. 29; 

Matches 23; Conservative 11; Mismatches 27; Indels 



8 ; Gaps 



3; 



Qy 

Db 



20 NTGISLLIPPD — AIPRGKIYEIYLTLHKPEDVRL — PLAGCQTLLSPIVSCGPPGVLLT 75 
: I : : : I I I II I 111:: | | | : I I I : : I I I 

931 HNGLRWTPPRTCAAPT RITCRLVKPQKLSTPPPLAEEEGLASRIIALGPTGAQFL 986 



Qy 

Db 



76 RPVILAMDH 84 

III: : I 
987 SPVIVEIPH 995 



RESULT 9 
A35049 

ankyrin 1, erythrocyte splice form 2 - human 

N; Alternate names: ankyrin 2.1, erythrocyte; ankyrin-R 

N; Contains: ankyrin 2.2, erythrocyte 

C; Species: Homo sapiens (man) 

C;Date: 27-Jul-1990 #sequence_revision Ol-Oct-1992 #text_change 09-Jul-2004 
C;Accession: A35049 

R^-Lambert, S.; Yu, H. ; Prchal, J.T.; Lawler, J.; Ruff, P.; Speicher, D. ; Cheung, 
M.C.; Kan, Y.W. ; Palek, J. 

Proc. Natl. Acad. Sci. U.S.A. 87, 1730-1734, 1990 

A; Title: cDNA sequence for human erythrocyte ankyrin. 

A; Reference number: A35049; MUID : 90175370; PMID: 1689849 

A; Accession: A35049 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-1880 <LAM> 

A; Cross-references: UNIPROT : P16157 ; GB:M28880 

C; Genetics : 

A; Gene: GDB : ANK1; ANK 

A; Cross-references: GDB: 118737; OMIM: 182900 
A; Map position: 8pll . 2-8pll . 2 

C; Super family : ankyrin; ankyrin repeat homology 



C; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 
F; 



Keywords: alternative splicing; cytoskeleton 

2-1880/Product: ankyrin 1, erythrocyte form 2 #status predicted <MAT> 
2-1513, 1676-1880/Product: ankyrin 2.2, erythrocyte #status predicted <MA2> 
44-76/Domain : ankyrin repeat homology <AN01> 
77-109/Domain: ankyrin repeat homology <AN02> 



110-142/Domain 
143-171/Domain 
172-204/Domain 
205-237/Domain 
238-270/Domain 
2 7 1-3 03/ Domain 
304-336/Domain 
337-369/Domain 
370-402/Domain 
403-435/Domain 
436-468/Domain 
469-501/Domain 
502-534/Domain 
53 5- 5 67 /Domain 
568-600/Domain 
601-633/Domain 
634-666/Domain 
667-699/Domain 
700-732/Domain 
733-765/Domain 
7 66- 7 9 8/ Domain 



ankyrin repeat homology <AN03> 
ankyrin repeat homology <AN04> 
ankyrin repeat homology <AN05> 
ankyrin repeat homology <AN06> 
ankyrin repeat homology <AN07> 
ankyrin repeat homology <AN08> 
ankyrin repeat homology <AN09> 
ankyrin repeat homology <AN10> 
ankyrin repeat homology <AN11> 
ankyrin repeat homology <AN12> 
ankyrin repeat homology <AN13> 
ankyrin repeat homology <AN14> 
ankyrin repeat homology <AN15> 
ankyrin repeat homology <AN16> 
ankyrin repeat homology <AN17> 
ankyrin repeat homology <AN18> 
ankyrin repeat homology <AN19> 
ankyrin repeat homology <AN20> 
ankyrin repeat homology <AN21> 
ankyrin repeat homology <AN22> 
ankyrin repeat homology <AN23> 



Query Match 12.9%; Score 72; DB 2; Length 1880; 

Best Local Similarity 33.3%; Pred. No. 29; 

Matches 23; Conservative 11; Mismatches 27; Indels 



8 ; Gaps 



3; 



Qy 

Db 

Qy 
Db 



20 NTGISLLIPPD — AIPRGKIYEIYLTLHKPEDVRL — PLAGCQTLLSPIVSCGPPGVLLT 75 
: I : : : II I II I III:: III : I I I : : I I t 

931 HNGLRWIPPRTCAAPT RITCRLVKPQKLSTPPPLAEEEGLASRIIALGPTGAQFL 986 

76 RPVTLAMDH 84 
III: : I 
987 SPVIVEIPH 995 



RESULT 10 
SJHUK 

ankyrin 1, erythrocyte splice form 1 - human 
N;Alternate names: ankyrin 2.1, erythrocyte; ankyrin-R 
N; Contains: ankyrin 2.2 
C; Species: Homo sapiens (man) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 09-Jul-2004 
C;Accession: S08275; A33219; PC2220; A35443 
R;Lux, S.E.; John, K.M. ; Bennett, V. 
Nature 344, 36-42 f 1990 

A; Title: Analysis of cDNA for human erythrocyte ankyrin indicates a repeated 
structure with homology to tissue-differentiation and cell-cycle control 
proteins . 

A;Reference number: S08275; MUID: 90158830; PMID:2137557 
A;Accession: S08275 
A;Molecule type: mRNA 
A; Residues: 1-1881 <LU1> 



A; Cross-references: UNIPROT : P16157 ; EMBL:X16609; NID:g28701; PIDN : CAA34610 . 1 ; 

PID:g28702 

A; Accession: A33219 

A; Molecule type: protein 

A;Residues: 2-7, 'X* ,9-11, 'XM9-20, "I" ,22-30; 73 3- 749, 1 A 1 , 751-753; 828- 

833, 'X' ,835-855, 'X» , 857-859, 1 XX 1 , 862-871; 959-1003; 1106-1120, » XX », 1123-1128; 1149- 

1172; 1282-1285, 'E' , 1287-1288 ; 1307-1332 ; 1345-1365, 'X' , 1367 ; 1383-1427 ; 1601- 

1630;1686-1698, 1 D 1 , 1700 ; 1763-1772 <LUX> 

A; Note: 845-Arg and 1392-Thr were also found 

R;Hermann, J.; Barel, M.; Frade, R. 

Biochem. Biophys . Res. Commun. 204, 453-460, 1994 

A; Title: Human erythrocyte ankyrin, a cytoskeleton component, generates the p57 

membrane proteinase which cleaves C3, the third component of complement. 

A; Reference number: PC2220; MUID: 95071348 ; PMID: 7526850 

A; Acces s ion : PC2 220 

A; Molecule type: protein 

A; Residues: 910-929 <HER> 

R;Davis, L.H.; Bennett, V. 

J. Biol. Chem. 265, 10589-10596, 1990 

A;Title: Mapping the binding sites of human erythrocyte ankyrin for the anion 
exchanger and spectrin. 

A; Reference number: A35443; MUID: 90285190; PMID:2141335 
A; Accession: A35443 
A;Molecule type: protein 

A; Residues: f X',5, 'X' , 7-12; 403-417, , X I , 419-422, 'H',424, 1 LQ 1 ; 797-800, 'L',802- 

814;862-863, f X f ,865-877; 'X 1 , 899-901, , T , ,903-909, 'X 1 , 911-912 <DAV> 

C; Genetics : 

A; Gene: GDB:ANK1; ANK 

A; Cross-references: GDB: 118737; OMIM: 182900 
A; Map position: 8pll . 2-8pll . 2 

C; Superf amily : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing; phosphoprotein 

F;2-1881/Product : ankyrin 1, erythrocyte form 1 #status predicted <MAT1> 
F; 2-1512, 1675-1881/Product: ankyrin 2.2, erythrocyte #status predicted <MAT2> 
F;2-827/Domain: 89K #status predicted <D0M1> 
F; 2-827/Region: anion exchange protein binding 
F; 44-76/Domain: ankyrin repeat homology <AN01> 
F; 77-109/Domain: ankyrin repeat homology <AN02> 
ankyrin repeat homology <AN03> 
ankyrin repeat homology <AN04> 
ankyrin repeat homology <AN05> 
ankyrin repeat homology <AN06> 
ankyrin repeat homology <AN07> 
ankyrin repeat homology <AN08> 
ankyrin repeat homology <AN09> 
ankyrin repeat homology <AN10> 
ankyrin repeat homology <AN11> 
ankyrin repeat homology <AN12> 
ankyrin repeat homology <AN13> 
ankyrin repeat homology <AN14> 
ankyrin repeat homology <AN15> 
ankyrin repeat homology <AN16> 
ankyrin repeat homology <AN17> 
ankyrin repeat homology <AN18> 
ankyrin repeat homology <AN19> 
ankyrin repeat homology <AN20> 
ankyrin repeat homology <AN21> 



F;110-142/Domain 
F;143-171/Domain 
F;172-204/Domain 
F;205-237/Domain 
F;238-270/Domain 
F;271-303/Domain 
F; 304-336/Domain 
F; 337-369/Domain 
F;370-402/Domain 
F;403-435/Domain 
F;436-468/Domain 
F;469-501/Domain 
F;502-534/Domain 
F;535-567/Domain 
F;568-600/Domain 
F; 601-633/Domain 
F; 634-666/Domain 
F; 667-699/Domain 
F;700-732/Domain 



F;733-765/Domain: ankyrin repeat homology <AN22> 
F; 7 66-7 98/ Domain: ankyrin repeat homology <AN23> 
F;828-1382/Domain: 62K #status predicted <D0M2> 
F; 828- 1382 /Region : spectrin binding 
F;1383-1881/Domain: 55K #status predicted <D0M3> 



Query Match 12.9%; Score 72; DB 1; Length 1881; 

Best Local Similarity 33.3%; Pred. No. 29; 

Matches 23; Conservative 11; Mismatches 27; Indels 8; Gaps 3; 

Qy 20 NTGISLLI PPD — AI PRGKI YEI YLTLHKPEDVRL — PLAGCQTLLSPIVSCGPPGVLLT 75 

: I : : : I I I II I III:: III : I I I :: I I I 

Db 931 HNGLRWIPPRTCAAPT RITCRLVKPQKLSTPPPLAEEEGLASRIIALGPTGAQFL 986 



Qy 76 RPVILAMDH 84 

III: : I 
Db 987 SPVTVEIPH 995 



RESULT 11 
S73122 

carbamoyl phosphate synthase small chain - red alga (Porphyra purpurea) 
chloroplast 

C; Species: chloroplast Porphyra purpurea 

C;Date: 19-Mar-1997 #sequence_revision 09-May-1997 #text_change 09-Jul-2004 

C;Accession: S73122 

R;Reith, M. ; Munholland, J. 

Plant Mol. Biol. Rep. 13, 333-335, 1995 

A; Title: Complete nucleotide sequence of the Porphyra purpurea chloroplast 
genome . 

A;Reference number: S73108 
A;Accession: S73122 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-384 <REI> 

A;Cross-references: UNIPROT: P51201; EMBL:U38804; NID : gl276652 ; PIDN: AAC08087 . 1 ; 
PID:gl276667 

A; Note: the nucleotide sequence was submitted to the EMBL Data Library, October 
1995 

C; Genetics : 

A; Gene: car A 

A; Genome : chloroplast 

C;Superfamily: carbamoyl -phosphate synthase (glutamine-hydrolyzing) small chain; 
carbamoyl-phosphate synthase (glutamine-hydrolyzing) small chain homology; trpG 
homology 

C; Keywords : chloroplast 

F; 9-37 3/ Domain : carbamoyl-phosphate synthase (glutamine-hydrolyzing) small chain 
homology <CPS> 

F;197-373/Domain: trpG homology <TRG> 
F;272/Active site: Cys #status predicted 

Query Match 12.5%; Score 70; DB 2; Length 384; 

Best Local Similarity 24.3%; Pred. No. 8.1; 

Matches 27; Conservative 20; Mismatches 26; Indels 38; Gaps 6; 

Qy 17 MIPNTGISLLIPPD— AIP RGKIYE 1 YLTLHKPEDVRL PLAG 56 

Mil ::: I ::| I III : I : : II I 



Db 161 LI PNVTTNIMYDWDEKSLPSWYLADRNREKI YSQLKVI VI DFGVKLNI LR RLATLG 216 

Qy 57 CQTLLSP IVSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQ 100 

II : I t : I I I I : I I : : I : I I : : : : I : 

Db 217 CQITVMPASTPTQDILSCKPDGILLS NGPGDPSAVNYGIKTVKE 260 



RESULT 12 
D72240 

tRNA guanine transglycosylase - Thermotoga maritima (strain MSB8) 
C; Species: Thermotoga maritima 

C;Date: ll-Jun-1999 #sequence_revision ll-Jun-1999 #text_change 09-Jul-2004 
C;Accession: D72240 

R;Nelson, K.E.; Clayton, R.A. ; Gill, S.R.; Gwinn, M.L.; Dodson, R.J.; Haft, 
D.H.; Hickey, E.K.; Peterson, J.D.; Nelson, W.C.; Ketchum, K.A. ; McDonald, L. ; 
Utterback, T.R.; Malek, J. A. ; Linher, K.D.; Garrett, M.M. ; Stewart, A.M.; 
Cotton, M.D.; Pratt, M.S.; Phillips, C.A. ; Richardson, D.; Heidelberg, J.; 
Sutton, G.G.; Fleischmann, R.D.; White, O.; Salzberg, S.L.; Smith, H.O.; Venter, 
J.C.; Fraser, CM. 
Nature 399, 323-329, 1999 

A;Title: Evidence for lateral gene transfer between Archaea and Bacteria from 
genome sequence of Thermotoga maritima. 

A; Reference number: A72200; MUID: 99287316; PMID : 10360571 
A; Accession: D72240 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-369 <ARN> 

A;Cross-references: UNIPROT:Q9XlP7; GB:AE001801; GB:AE000512; NID: g4982114 ; 
PIDN:AAD36627.1; PID: g4982128 ; TIGR:TM1561 
A; Experimental source: strain MSB8 
C; Genetics : 
A;Gene: TM1561 

C; Superfamily: queuine tRNA-ribosyltransf erase 

Query Match 12.4%; Score 69.5; DB 2; Length 369; 

Best Local Similarity 23.9%; Pred. No. 8.8; 

Matches 26; Conservative 19; Mismatches 35; Indels 29; Gaps 6; 

Qy 7 GTFNFLGGR — LMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDV — RLPLAGCQTLLS 62 

I ||:| : :: : I : ::|: :| : | | | |: | : |: 

Db 75 GLHN FMGWKRP I LT DS GGFQVF SLPKIRIDD EGWFRS P I DGS KVFLN 122 

Qy 63 PIVSCGPPGVLLTRPVILAMDHCGEPSPD SWSLRLKK 99 

I : I : I : : I I I I I I : I I I I 

Db 123 PEISM-EVQIALGSDICMVFDHCPVPDADYEEVKEATERTYRWALRSKK 170 



RESULT 13 
148780 

Stral/Eplg2 protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 09-Jul-2004 
C;Accession: 148780; A5 55.07; A55062; S52670 

R;Bouillet, P.; Oulad-Abdelghani, M. ; Vicaire, S.; Gamier, J.M. ; Schuhbaur, B. ; 

Dolle, P.; Chambon, P. 

Dev. Biol. 170, 420-433, 1995 



A;Title: Efficient cloning of cDNAs of retinoic acid-responsive genes in P19 
embryonal carcinoma cells and characterization of a novel mouse gene, Stral 
(mouse LERK-2/Eplg2) . 

A; Reference number: 148780; MUID: 95377533; PMID:7649373 
A;Accession: 148780 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-345 <RES> 

A; Cross-references: UNIPROT : P52795 ; EMBL:Z48781; NID:g747858; PIDN : CAA88695 . 1 ; 
PID:g747859 

R; Fletcher, F.A. ; Renshaw, B.; Hollingsworth, T.; Baum, P.; Lyman, S.D.; 
Jenkins, N.A. ; Gilbert, D.J.; Copeland, N.G.; Davison, B.L. 
Genomics 24, 127-132, 1994 

A;Title: Genomic organization and chromosomal localization of mouse Eplg2, a 

gene encoding a binding protein for the receptor tyrosine kinase Elk. 

A; Reference number: A55507; MUID: 95203867 ; PMID: 7896266 

A;Accession: A55507 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-345 <FLE> 

A; Cross-references : GB:U07598 

R;Shao, H.; Lou, L. ; Pandey, A.; Pasquale, E.B.; Dixit, V.M. 
J. Biol. Chem. 269, 26606-26609, 1994 

A; Title: cDNA cloning and characterization of a ligand for the Cek5 receptor 
protein-tyrosine kinase. 

A; Reference number: A55062; MUID: 95014510; PMID:7929389 
A; Accession: A55062 

A; Status: preliminary; not compared with conceptual translation 

A; Molecule type: mRNA 

A; Residues: 1-89, 1 T 1 , 91-345 <SHA> 

A; Cross-references: GB:U12983; NID:g575928; PIDN : AAA53231 . 1 ; PID:g575929 
C; Genetics : 
A; Gene: EPLG2 

Query Match 12.3%; Score 69; DB 2; Length 345; 

Best Local Similarity 30.9%; Pred. No. 9.3; 

Matches 29; Conservative 11; Mismatches 40; Indels 14; Gaps 5 

Qy 11 FLGGR — LMIPNTGISLLIPPDAIPRGKIYEIY-LTLHKPEDVRLPLAGCQTLLSP — IV 65 

III: : : I I II I : I I I I I : I I I I hi I : I 

Db 44 FLSGKGLVIYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQA AACSTVLDPNVLV 99 



Qy 66 SCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKK 99 

: I I : : III: III 

Db 100 TCNKPHQEIRFTIKFQ EFS PNYMGLEFKK 128 



RESULT 14 
158406 

LERK-2 - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 26-Jul-1996 #sequence_revision 26-Jul-1996 #text_change 09-Jul-2004 
C; Accession: 158406 

R;Fletcher, F.A. ; Carpenter, M. ; Shilling, H.; Baum, P.; Ziegler, S.; Gimpel, 
S.; Hollingsworth, T.; VandenBos, T.; Davison, B.L.; Lyman, S.D.; Beckmann, M. 
Oncogene 9, 3241-3248, 1994 



A; Title: LERK-2, a ligand for the receptor tyrosine kinase ELK, is 
evolutionarily conserved and expressed in a developmentally regulated pattern. 
A; Reference number: 158406; MUID : 95022634 ; PMID: 7936648 
A;Accession: 158406 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-345 <RES> 

A; Cross-references: UNIPROT : P52796; EMBL:U07560; NID:g563118; PIDN :AAA53 092 . 1 ; 
PID:g563119 
C; Genetics : 
A; Gene : Eplg2 

Query Match 12.3%; Score 69; DB 2; Length 345; 

Best Local Similarity 31.9%; Pred. No. 9.3; 

Matches 30; Conservative 10; Mismatches 40; Indels 14; Gaps 5; 

Qy 11 FLGGR — LMIPNTGISLLIPPDAIPRGKIYEIY-LTLHKPEDVRLPLAGCQTLLSP — IV 65 

III: : : I I II I : I I I I I : I I I I I : I I : I 

Db 44 FLSGKGLVTYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQA AACSTVLDPNVLV 99 

Qy 66 SCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKK 99 

: I I : I III: III 

Db 100 TCNKP QQEI RFTI KFQEFS PNYMGLEFKK 128 



RESULT 15 
S46993 

elk ligand - human 

C; Species: Homo sapiens (man) 

C;Date: 15-Jul-1995 ffsequence_revision 10-Nov-1995 #text_change 09-Jul-2004 
C; Accession: S46993 

R;Beckmann, M.P.; Cerretti, D.P.; Baum, P.; vanden Bos, T.; James, L. ; Far rah, 
T.; Kozlosky, C; Hollingsworth, T.; Shilling, H.; Maraskovsky, E. ; Fletcher, 
F.A. ; Lhotak, V.; Pawson, T. ; Lyman, S.D. 
EMBO J. 13, 3757-3762, 1994 

A; Title: Molecular characterization of a family of ligands for eph-related 
tyrosine kinase receptors . 

A; Reference number: S46993; MUID : 94349923 ; PMID: 8070404 
A; Accession: S46993 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-346 <BEC> 

A; Cross-references: UNIPROT : P98172 ; GB:U09304; NID:g538366; PIDN : AAA53093 . 1; 
PID:g538367 

Query Match 12.2%; Score 68; DB 2; Length 346; 

Best Local Similarity 30.9%; Pred. No. 12; 

Matches 29; Conservative 11; Mismatches 40; Indels 14; Gaps 5; 

Qy 11 FLGGR — LMI PNTGI SLLI PPDAI PRGKI YEI Y-LTLHKPEDVRLPLAGCQTLLSP — IV 65 

III: : : I I II I : I I I I I : I I I I hi I : I 

Db 44 FLSGKGLVIYPKIGDKLDIICPRAEAGRPYEYYKLYLVRPEQA AACSTVLDPNVLV 99 

Qy 66 SCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKK 99 

: I I : : III: III 

Db 100 TCNRPEQEIRFTIKFQ EFS PNYMGLEFKK 128 



Search completed: March 1, 2005, 09:07:22 
Job time : 5.05992 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 09:06:01 ; Search time 20.0163 Seconds 

(without alignments) 
1704.439 Million cell updates/sec 

US-10-624-932-2_COPY_495_598 
559 

1 TSNMTYGTFNFLGGRLMIPN CGEPSPDSWSLRLKKQSCEG 104 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1385339 seqs, 328044528 residues 

Total number of hits satisfying chosen parameters: 



1385339 



Minimum^DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applications_AA: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB . pep : * 

2: /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep:* 

3: /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep:* 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB . pep : * 

6: /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB .pep : * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB .pep : * 

9 : /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB .pep : 
10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 
11: /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep 
12: /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 
13: / cgn2_6/pt odata/ 1 /pubpaa/US 1 0A_PUBCOMB . pep 
14: /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 
15: /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 
16: /cgn2_6/ptodata/l/pubpaa/US10D_PUBCOMB.pep 
17: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep 
18: / cgn2_6/pt odata/ 1/pubpaa /US 1 1_NEW_PUB . pep 
19: /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep 
20: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 



SUMMARIES 



Result Query 



No. 


Score 


Match Length 


DB 


ID 




Description 


1 


559 


100. 


0 


544 


10 


us- 


09-970-944-14 


Sequence 


14, Appl 


2 


559 


100. 


0 


842 


15 


us- 


10-311-623-1 


Sequence 


1, Appli 


3 


559 


100. 


0 


898 


10 


us- 


09-918-779-2 


Sequence 


2, Appli 


4 


559 


100. 


0 


898 


15 


us- 


10-624-932-2 


Sequence 


2, Appli 


5 


559 


100. 


0 


899 


10 


us- 


09-970-944-2 


Sequence 


2, Appli 


6 


555 


99. 


3 


436 


15 


us- 


10-296-115-1104 


Sequence 


1104, Ap 


7 


553 


98. 


9 


557 


10 


us- 


09-933-261-6 


Sequence 


6, Appli 


8 


553 


98. 


9 


557 


14 


us- 


10-256-702-6 


Sequence 


6, Appli 


9 


553 


98. 


9 


898 


10 


us- 


09-933-261-5 


Sequence 


5, Appli 


10 


553 


98. 


9 


898 


10 


us- 


09-970-944-13 


Sequence 


13, Appl 


1 1 

X 1 


u -j o 


98. 


9 


898 


1 4 
x *± 




1 D-2 Sfi-702-S 


Sequence 


5, Appli 


1 2 


■J J J 


98. 


9 


898 


1 4 
x *± 


IIS- 

VJ tj 


1 D-24 0-1 S4-1 6 


Sequence 


16, Appl 




^4 ft 


62. 


3 


931 


X VJ 


VJ o 


0Q-Q7n-Q44-1 S 


Sequence 


15, Appl 


1 4 

X 1 


^4 ft 


62. 


3 


931 


X X 


IIS- 

VJ fcj 


09-972-211-121 

VJ 3* J 1 X X X ^- X 


Sequence 


121, App 


15 


348 


62. 


3 


931 


15 


us- 


10-087-684-35 


Sequence 


35, Appl 


1 

X u 


348 


62. 


3 


931 


15 


us- 


10-037-417-117 

XVJ VJ J / *1X/ XX/ 


Sequence 


117, App 


1 7 

X / 


^4 ft 

O 1 o 


62. 


3 


931 


1 s 

X «J 


vj o 


1 n-DQfi-fi2 S- 1 21 

XVJ VJ 27 VJ VJ «J X^.X 


Sequence 


121, App 


1 ft 

X o 


^4 ft 
»J *± o 


62. 


3 


1010 


1 S 

X _> 


IIS- 

VJ tj 


1 0-21 8-77Q-^S 


Sequence 


35, Appl 


X J 


^7 


60. 


3 


931 


x vj 


US- 


nq_q7n-Q44-1 6 

\J_7 -7/VJ _7TT X\J 


Sequence 


16, Appl 


20 


337 


60. 


3 


931 


10 


us- 


09-970-944-17 


Sequence 


17, Appl 


21 

^ X 


337 


60. 


3 


931 


11 


US- 


09-972-211-122 


Sequence 


122, App 


22 


^7 


60. 


3 


931 


1 1 

X X 


Vj o 


nq-Q72-21 1 -1 2 S 

VJ _/ Z? 1 ill X^^J 


Sequence 


125, App 


2^ 

-j 


^7 

.j «j / 


60. 


3 


931 


X —> 


IIS- 


i n-n87-fi84-^fi 

XVJ VJ U / VJ O M -J VJ 


Sequence 


36, Appl 


24 


^7 


60. 


3 


931 


1 S 

X J 


us- 


10-218-779-36 


Sequence 


36, Appl 


2 S 


"3 "37 


60. 


3 


931 


15 


us- 


10-037-417-118 


Sequence 


118, App 




^^7 


60. 


3 


931 


X J 


us- 


10-037-417-119 


Sequence 


119, App 


97 
^- / 


^7 


60. 


3 


931 


1 S 


us- 


10-037-417-120 


Sequence 


120, App 




3^7 


60. 


3 


931 


X -J 


us- 


10-096-625-122 


Sequence 


122, App 


2Q 


^^7 


60. 


3 


931 


X *J 


us- 


10-096-625-125 


Sequence 


125, App 


30 


287 


51. 


3 


223 


9 


US-09-764-870-436 


Sequence 436, App 


^1 

•J X 


287 


51. 


3 


223 


1 4 

X " 


us- 


10-125-540-436 


Sequence 


436, App 


1? 


287 


51. 


3 


554 


X -J 


us- 


10-108-260A-2682 


Sequence 


2682, Ap 


J o 


287 


51. 


3 


933 


1 S 

X *J 


us- 


10-087-684-2 


Sequence 


2, Appli 


^4 
-j i 


287 


51. 


3 


933 


15 

X w/ 


us- 


10-087-684-4 


Sequence 


4, Appli 


35 


287 


51. 


3 


933 


15 


us- 


10-218-779-2 


Sequence 


2, Appli 


O VJ 


287 


51. 


3 


933 


1 s 


us- 


10-218-779-4 


Sequence 


4, Appli 


^7 


287 


51. 


3 


945 


14 


us- 


10-028-072-146 


Sequence 


146, App 


^8 


287 


51. 


3 


945 


1 4 

X *i 


us- 


10-140-808-146 


Sequence 


146, App 


39 


287 


51. 


3 


945 


14 


us- 


10-121-049-146 


Sequence 


146, App 


40 


287 


51. 


3 


945 


14 


us- 


10-123-904-146 


Sequence 


146, App 


41 


287 


51. 


3 


945 


14 


us- 


10-140-470-146 


Sequence 


146, App 


42 


287 


51. 


3 


945 


14 


us- 


10-175-746-146 


Sequence 


146, App 


43 


287 


51. 


3 


945 


14 


us- 


10-176-918-146 


Sequence 


146, App 


44 


287 


51. 


3 


945 


14 


us- 


10-176-921-146 


Sequence 


146, App 


45 


287 


51. 


3 


945 


14 


us- 


10-137-865-146 


Sequence 


146, App 



ALIGNMENTS 



RESULT 1 

US-09-970-944-14 

; Sequence 14, Application US/09970944 
; Publication No. US20030204052A1 



; GENERAL INFORMATION: 

; APPLICANT: Herman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204 052Alel Proteins and Nucleic Acids Encoding 
Same and 

TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 14 

LENGTH: 544 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-970-944-14 



Query Match 100.0%; Score 559; DB 10; Length 544; 

Best Local Similarity 100.0%; Pred. No. 2.6e-56; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 141 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 200 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 201 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 244 



RESULT 2 
US-10-311-623-1 

Sequence 1, Application US/10311623 
Publication No. US20040023244A1 
GENERAL INFORMATION: 
APPLICANT: INCYTE GENOMICS, INC.; GRIFFIN, Jennifer A. 
APPLICANT: KALLICK, Deborah A. ; TRIBOULEY, Catherine M. 
APPLICANT: YUE, Henry; NGUYEN, Danniel B. 
APPLICANT: TANG, Y. Tom; LAL, Preeti G. 
APPLICANT: POLICKY, Jennifer L. ; AZIMZAI, Yalda 
APPLICANT: LU, Dyung Aina M. ; GRAUL, Richard C. 
APPLICANT: YAO, Monique G. ; BURFORD, Neil 
APPLICANT: HAFALIA, April J. A.; BAUGHN, Mariah R. 
APPLICANT: BANDMAN, Olga; ARVIZU, Chandra S. 
APPLICANT: YANG, Junming; XU, Yuming 
APPLICANT: GANDHI, Ameena R. ; WARREN, Bridget A. 
APPLICANT: DING, Li; SANJANWALA, Madhusudan M. 
APPLICANT: DUGGAN, Brendan M. ; LU, Yan 
TITLE OF INVENTION: RECEPTORS 
FILE REFERENCE: PF-0793 USN 
CURRENT APPLICATION NUMBER: US/10/311,623 
CURRENT FILING DATE: 2002-12-17 
PRIOR APPLICATION NUMBER: US 01/19942 
PRIOR FILING DATE: 2001-06-21 



PRIOR APPLICATION NUMBER: US 60/214,027 
PRIOR FILING DATE: 2000-06-21 
PRIOR APPLICATION NUMBER: US 60/228,045 
PRIOR FILING DATE: 2000-08-25 
PRIOR APPLICATION NUMBER: US 60/255,104 
PRIOR FILING DATE: 2000-12-12 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: PERL Program 
SEQ ID NO 1 
LENGTH: 842 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/KEY: misc_feature 

OTHER INFORMATION: Incyte ID No. US20040023244A1 6052371CD1 
US-10-311-623-1 

Query Match 100.0%; Score 559; DB 15; Length 842; 

Best Local Similarity 100.0%; Pred. No. 4.5e-56; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

, I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 439 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 498 

Qy 61 LSPIVSCGPPGVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 499 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 542 



RESULT 3 
US-09-918-779-2 

; Sequence 2, Application US/09918779 
; Publication No. US20030064369A1 
; GENERAL INFORMATION: 



APPLICANT: 


Taupier, 


Raymond 


APPLICANT: 


Padigaru 


, Muralidhara 


APPLICANT: 


Rastelli 


, Luca 


APPLICANT: 


Spaderna 


, Steven 


APPLICANT: 


Shimkets 


, Richard 


APPLICANT: 


Zerhusen 


, Bryan 


APPLICANT: 


Spytek, 


Kimberly 


APPLICANT: 


Shenoy, 


Suresh 


APPLICANT: 


Li, Li 




APPLICANT: 


Gusev, Vladimir 


APPLICANT: 


Grosse, 


William 


APPLICANT: 


Alsobrook, John 


APPLICANT: 


Lepley, 


Denise 


APPLICANT: 


Burgess, 


Catherine 


APPLICANT: 


Gerlach, 


Valerie 


APPLICANT: 


Ellerman 


, Karen 


APPLICANT: 


MacDougall, John 


APPLICANT : 


Stone, David 


APPLICANT: 


Smithson 


, Glennda 



TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 

FILE REFERENCE: 21402-074 US 

CURRENT APPLICATION NUMBER: US/09/918,779 



; CURRENT FILING DATE: 2001-07-30 

; PRIOR APPLICATION NUMBER: 60/221,409 

; PRIOR FILING DATE: 2000-07-28 

; PRIOR APPLICATION NUMBER: 60/222,840 

; PRIOR FILING DATE: 2000-08-04 

; PRIOR APPLICATION NUMBER: 60/223,752 

; PRIOR FILING DATE: 2000-08-08 

; PRIOR APPLICATION NUMBER: 60/223,762 

; PRIOR FILING DATE: 2000-08-08 

; PRIOR APPLICATION NUMBER: 60/223,770 

; PRIOR FILING DATE: 2000-08-08 

; PRIOR APPLICATION NUMBER: 60/223,769 

; PRIOR FILING DATE: 2000-08-08 

; PRIOR APPLICATION NUMBER: 60/225,146 

; PRIOR FILING DATE: 2000-08-14 

; PRIOR APPLICATION NUMBER: 60/225,392 

; PRIOR FILING DATE: 2000-08-15 

; PRIOR APPLICATION NUMBER: 6.0/225,470 

; PRIOR FILING DATE: 2000-08-15 

; PRIOR APPLICATION NUMBER: 60/225,697 

; PRIOR FILING DATE: 2000-08-16 

; PRIOR APPLICATION NUMBER: 60/263,662 

; PRIOR FILING DATE: 2001-02-01 

; PRIOR APPLICATION NUMBER: 60/281,645 

; PRIOR FILING DATE: 2001-04-05 

; NUMBER OF SEQ ID NOS : 61 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 898 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-918-779-2 



Query Match 100.0%; Score 559; DB 10; Length 898; 

Best Local Similarity 100.0%; Pred. No. 4.9e-56; 

Matches 104; Conservative 0; Mismatches 0; „ Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 



Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 4 
US-10-624-932-2 

Sequence 2, Application US/10624932 
Publication No. US20040096877A1 
GENERAL INFORMATION: 
APPLICANT: Taupier, Raymond 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Rastelli, Luca 
APPLICANT: Spaderna, Steven 
APPLICANT: Shimkets, Richard 
APPLICANT: Zerhusen, Bryan 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 



Spytek, Kimberly 
Shenoy, Suresh 
Li, Li 

Gusev, Vladimir 
Grosse, William 
Alsobrook, John 
Lepley, Denise 
Burgess , Catherine 
Gerlach, Valerie 
Ellerman, Karen 
MacDougall, John 



Stone, David 
Smithson, Glennda 
TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/10/624 , 932 
CURRENT FILING DATE: 2003-07-21 
PRIOR APPLICATION NUMBER: 09/918,779 
PRIOR FILING DATE: 2001-07-03 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 61 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 2 
LENGTH: 898 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-624-932-2 



Query Match 100.0%; Score 559; DB 15 

Best Local Similarity 100.0%; Pred. No. 4.9e-56 
Matches 104; Conservative 0; Mismatches 0 



Length 898; 

Indels 0; Gaps 0; 



Qy 

Db 



1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 



Qy 

Db 



61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 
555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 5 
US-09-970-944-2 

; Sequence 2, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE:. 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 899 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-970-944-2 

Query Match 100.0%; Score 559; DB- 10; Length 899; 

Best Local Similarity 100.0%; Pred. No. 4.9e-56; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LS PI VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 555 LS P I VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEG 598 



RESULT 6 " 
US-10-296-115-1104 

; Sequence 1104, Application US/10296115 
; Publication No. US20040053248A1 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq Inc 

; TITLE OF INVENTION: No. US20040053248Alel Nucleic Acids and Polypeptides 
; FILE REFERENCE: 784PCT 

; CURRENT APPLICATION NUMBER: US/10/296,115 

; CURRENT FILING DATE: 2002-11-18 

; PRIOR APPLICATION NUMBER: US09/488,725 

; PRIOR FILING DATE: 2000-01-21 

; PRIOR APPLICATION NUMBER: US09/552,317 

; PRIOR FILING DATE: 2000-04-25 

; NUMBER OF SEQ ID NOS: 1478 

; SEQ ID NO 1104 

LENGTH: 436 

TYPE: PRT 



; ORGANISM: Homo sapiens 
US-10-296-115-1104 



Query Match 99.3%; Score 555; DB 15; Length 436; 

Best Local Similarity 99.0%; Pred. No. 5.9e-56; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I II I 



LSPIVSCGPPGVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
LSPIVSCGPPGVLLTRPVILGMDHCGEPSPDSWSLRLKKQSCEG 135 



Qy 


1 


Db 


32 


Qy 


61 


Db 


92 



RESULT 7 
US-09-933-261-6 

; Sequence 6, Application US/09933261 
; Publication No. US20030040046A1 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

; TITLE OF INVENTION: Netrin Receptors 

NUMBER OF SEQUENCES : 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW. GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0 , Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
; FILING DATE: 20-Aug-2001 

CLASSIFICATION: <Unknown> 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 08/808,982 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627' 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 557 amino acids 



TYPE: amino acid 

STRANDEDNESS : No. US20030040046A1 Relevant 
TOPOLOGY: No. US20030040046A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-09-933-261-6 

Query Match 98.9%; Score 553; DB 10; Length 557; 

Best Local Similarity 99.0%; Pred. No. 1.4e-55; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 153 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 212 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I 
Db 213 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLALKKQSCEG 256 



RESULT 8 
US-10-256-702-6 

; Sequence 6, Application US/10256702 
; Publication No. US20030059859A1 
; GENERAL INFORMATION: 

; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & . TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/256,702 
FILING DATE: 27-Sep-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
; FILING DATE: 20-Aug-2001 

APPLICATION NUMBER: 08/808,982 
; FILING DATE: <Unknown> 

ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 



TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
TOPOLOGY: No. US20030059859A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-256-702-6 



Query Match 98.9%; Score 553; DB 14; Length 557; 

Best Local Similarity 99.0%; Pred. No. 1.4e-55; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 153 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 212 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 213 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLALKKQSCEG 256 



RESULT 9 
US-09-933-261-5 

; Sequence 5, Application US/09933261 
; Publication No. US20030040046A1 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 

COUNTRY: USA 
; ZIP: 94104 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/933,261 

FILING DATE: 20-Aug-2001 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/808,982 

FILING DATE: <Unknown> 



ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
TYPE: amino acid 

STRANDEDNESS: No. US2003004 004 6A1 Relevant 
TOPOLOGY: No. US20030040046A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-933-261-5 

Query Match 98.9%; Score 553; DB 10; Length 898; 

Best Local Similarity 98.1%; Pred. No. 2.5e-55; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPVVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 10 
US-09-970-944-13 

; Sequence 13, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970, 944 
; CURRENT FILING DATE: 2002-05-02 
; PRIOR APPLICATION NUMBER: 60/237,862 
; PRIOR FILING DATE: 2000-10-04 
; NUMBER OF SEQ ID NOS : 62 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 13 

LENGTH: 898 
; TYPE: PRT 

; ORGANISM: Rattus norvegicus 
US-09-970-944-13 

Query Match 98.9%; 
Best Local Similarity 98.1%; 
Matches 102; Conservative 



Score 553; DB 10; Length 898; 
Pred. No. 2.5e-55; 
1; Mismatches 1; Indels 0; Gaps 0; 



Qy 

Db 



1 

495 



TSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
TSNMAYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 



Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPWSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 11 
US-10-256-702-5 

; Sequence 5, Application US/10256702 
; Publication No. US20030059859A1 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
; Leonardo/ E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/10/256,702 

FILING DATE: 27-Sep-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
FILING DATE: 20-Aug-2001 
APPLICATION NUMBER: 08/808,982 
; FILING DATE: <Unknown> 

ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
; TOPOLOGY: No. US20030059859A1 Relevant 

; MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-10-256-702-5 



Query Match 98.9%; Score 553; DB 14; Length 898; 

Best Local Similarity 98.1%; Pred. No. 2.5e-55; 

Matches 102; Conservative p 1; Mismatches 1; Indels 0; Gaps 0 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPVVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 12 
US-10-240-154-16 

Sequence 16, Application US/10240154 
Publication No. US20030175741A1 
GENERAL INFORMATION: 
APPLICANT: Cochran et al. 

TITLE OF INVENTION: SCHIZOPHRENIA RELATED GENES 
FILE REFERENCE: CKFW-P01-006 
CURRENT APPLICATION NUMBER: US/10/240, 154 
CURRENT FILING DATE: 2001-04-02 
PRIOR APPLICATION NUMBER: PCT/GB01/0148 6 
PRIOR FILING DATE: 2001-04-02 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 16 
LENGTH: 898 
TYPE: PRT 

ORGANISM: Rattus sp. 
US-10-240-154-16 

Query Match 98.9%; Score 553; DB 14; Length 898; 

Best Local Similarity 98.1%; Pred. No. 2.5e-55; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LS PI VSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 555 LSPWSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 13 
US-09-970-944-15 

; Sequence 15, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; . APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 



; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER : US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 15 

LENGTH: 931 
; TYPE: PRT 

; ORGANISM: Caenorhabditis elegans 
US-09-970-944-15 



Query Match 62.3%; Score 348; DB 10; Length 931; 

Best Local Similarity 58.6%; Pred. No. 2.1e-31; 

Matches 58; Conservative 23; Mismatches 18; Indels 0; Gaps 0; 

Qy 6 YGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I I I I III I : I I I : I : I II I I I I I : I : : I I : I : I : I : I : : I I : I I I I : I : I 
Db 533 FGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYEMYVTVHRKENMRPPMEDSQTLLTPW 592 

Qy 66 SCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

II I I I I I I I I I I I I : II : I I : I : : I I I : : I 
Db 593 SCGPPGALLTRPVT LTLHHCADPSTEDWKIQLKNQAVQG 631 



RESULT 14 
US-09-972-211-121 

Sequence 121, Application US/09972211 
Publication No. US20040048245A1 
GENERAL INFORMATION: 
APPLICANT: Shimkets, Richard A 
APPLICANT: Taupier Jr, Raymond J 
APPLICANT: Burgess, Catherine E 
APPLICANT: Zerhusen, Bryan D 
APPLICANT: Mezes, Peter S 
APPLICANT: Rastelli, Luca 
APPLICANT: Malyankar, Uriel M 
APPLICANT: Grosse, William M 
APPLICANT: Alsobrook II, John P 
APPLICANT: Lepley, Denise M 
APPLICANT: Spytek, Kimberly Ann 
APPLICANT: Li, Li 
APPLICANT: Edinger, Shlomit 
APPLICANT: Gerlach, Valerie 
APPLICANT: Ellerman, Karen 
APPLICANT: MacDougall, John R 
APPLICANT: Gunther, Erik 
APPLICANT: Millet, Isabelle 
APPLICANT: Stone, David J 
APPLICANT: Smithson, Glennda 
APPLICANT: Szekeres Jr, Edward S 

TITLE OF INVENTION: No. US20040048245Alel Human Proteins, Polynucleotides 
Encoding Them And 



; TITLE OF INVENTION: Methods Of Using The Same 

; FILE REFERENCE: 21402-141 

; CURRENT APPLICATION NUMBER: US/09/972,211 

; CURRENT FILING DATE: 2001-10-05 

; PRIOR APPLICATION NUMBER: 60/238,325 

; PRIOR FILING DATE: 2000-10-05 

; PRIOR APPLICATION NUMBER: 60/238,323 

; PRIOR FILING DATE: 2000-10-05 

; PRIOR APPLICATION NUMBER: 60/238,400 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,397 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,401 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,379 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,402 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 30/238,384 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,373 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,372 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,383 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,382 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/275,892 

; PRIOR FILING DATE: 2001-03-14 

; PRIOR APPLICATION NUMBER: 60/296,860 

; PRIOR FILING DATE: 2001-06-08 

; NUMBER OF SEQ ID NOS : 198 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 121 

LENGTH: 931 

TYPE: PRT 

; ORGANISM: Mus mus cuius 
US-09-972-211-121 



Query Match 62.3%; Score 348; DB 11; Length 931; 

Best Local Similarity 58.6%; Pred. No. 2.1e-31; 

Matches 58; Conservative 23; Mismatches 18; Indels 0; Gaps 0 

Qy 6 YGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I I I I III I : I I I : I : I I I II I I I : I : : I I : I : I : I : I : : I I : I I I I : I : I 
Db 533 FGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYEMYVTVHRKENMRPPMEDSQTLLTPW 592 

Qy 66 SCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I ! I I I I I I I I : I I : I I : [ : : I I I : : I 
Db 593 SCGPPGALLTRPVI LTLHHCADPSTEDWKIQLKNQAVQG 631 



RESULT 15 
US-10-087-684-35 

; Sequence 35, Application US/10087684 
; Publication No. US20040029116A1 



GENERAL INFORMATION: 



APPLICANT 

AT till V^iUN X 


Edinger, 


Shlomit R. 


APPLICANT 
ru rui v^iui x 


MacDougall, John R. 


APPLICANT 


Millet, 


Isabelle 


ADDT TPAMT 
/\ir 1: Ijx, L.rMN 1 


Ellerman, Karen 


APPLICANT 


Stone, 


David J. 


APPLICANT 


Grosse, 


William M. 




Lepley, 


Denise M. 


APPT TCAMT 
.rvr IT xjx. wt±n 1 


Rieger, 


Daniel K. 


ADDT TrMTT 
r\c rLl U/\i>J 1 


Burgess 


, Cathereine E. 


ADDT T fZVMT 
r\Jz rLl 1 


Casman, 


Stacie, J. 


a dpt Tfaxrp 


Spytek, 


Kimberly A. 


a dpt TraMT 

/\X7 c LiL\*r\r\ 1 


Boldog, 


Ferenc L. 


ADDT TrQMT 


Li, Li 




}\r r J_i 1 L./\Jn 1 


Padigaru, Muralidhara 


7v ppT TPZVMT 


Mishra, 


Vishnu 


ZVPPT TfaMT 


Shenoy, 


Suresh G. 


ADDT TPZiMT « 


Rastelli, Luca 


APPT.TCANT 
rAxr rLl ^tmn x 


Tchernev, Velizar T. 


APPLICANT 


Vernet, 


Corine A.M. 


APPLICANT 


Zerhusen, Bryan D. 


APPLICANT 


Malyankar, Uriel M. 


APPLICANT 


Guo, Xiaojia 


APPLICANT 


Miller, 


Charles E. 


APPLICANT 


Gangoll 


i, Esha A. 



; TITLE OF INVENTION: PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

; FILE REFERENCE: 21402-214 CIP 

; CURRENT APPLICATION NUMBER: US/10/087,684 

; CURRENT FILING DATE: 2003-03-10 

; PRIOR APPLICATION NUMBER: 60/253,834 

; PRIOR FILING DATE: 2000-11-29 

; PRIOR APPLICATION NUMBER: 60/250,926 

; PRIOR FILING DATE: 2000-11-30 

; PRIOR APPLICATION NUMBER: 60/264,180 

; PRIOR FILING DATE: 2001-01-25 

; PRIOR APPLICATION NUMBER: 60/274,194 

; PRIOR FILING DATE: 2001-03-08 

; PRIOR APPLICATION NUMBER: 60/313,656 

; PRIOR FILING DATE: 2001-08-20 

; PRIOR APPLICATION NUMBER: 60/327,456 

; PRIOR FILING DATE: 2001-10-05 

; NUMBER OF SEQ ID NOS:220 

; SOFTWARE: CuraSeqList version 0.1 

; SEQ ID NO 35 

LENGTH: 931 

TYPE: PRT 

; ORGANISM: Mus mus cuius 
US-10-087-684-35 

Query Match 62.3%; Score 348; DB 15; Length 931; 

Best Local Similarity 58.6%; Pred. No. 2.1e-31; 

Matches 58; Conservative 23; Mismatches 18; Indels 0; Gaps 0; 

Qy 6 YGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I I I I III I : I I I : I : I I I I I I I I : I : : I I : I : I : I : I :: I I : I I I I : I : I 
Db 533 FGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYEMYVTVHRKENMRPPMEDSQTLLTPW 592 



Qy 66 SCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I : II : I I : I : : I I I : : I 
Db 593 SCGP PGALLTRPVI LTLHHCADP STEDWKI QLKNQAVQG 631 



Search completed: March 1, 2005, 09:51:32 
Job time : 21.0163 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 08:41:47 ; Search time 19.025 Seconds 

(without alignments) 
2799.282 Million cell updates/sec 

US-10-624-932-2_COPY_495_598 
559 

1 TSNMTYGTFNFLGGRLMIPN CGEPSPDSWSLRLKKQSCEG 104 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1612378 



Database 



UniProt_03:* 
1: uniprot__sprot : * 
2: uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


559 


100. 


0 


842 


1 


UN5A_HUMAN 


Q6zn44 


homo sapien 


2 


554 


99. 


1 


898 


1 


UN 5 A MOUSE 


Q8kls4 


mus musculu 


3 


553 


98. 


9 


898 


1 


UN 5 A RAT 


008721 


rattus norv 


4 


348 


62. 


3 


931 


1 


UN5C MOUSE 


008747 


mus musculu 


5 


346 


61. 


9 


931 


1 


UN5C_CHICK 


Q7t2z5 


gallus gall 


6 


345 


61. 


7 


931 


1 


UN5C RAT 


Q761x5 


rattus norv 


7 


337 


60. 


3 


931 


1 


UN5C HUMAN 


095185 


homo sapien 


8 


309 


55. 


3 


943 


1 


UN5B XENLA 


Q8jgt4 


xenopus lae 


9 


287 


51. 


3 


945 


1 


UN5B HUMAN 


Q8izjl 


homo sapien 


10 


277 


49. 


6 


945 


1 


UN5B_RAT 


008722 


rattus norv 


11 


274 


49. 


0 


945 


1 


UN5B_M0USE 


Q8kls3 


mus musculu 


12 


245.5 


43. 


9 


956 


1 


UN5D MOUSE 


Q8kls2 


mus musculu 


13 


243.5 


43. 


6 


953 


1 


UN5D_HUMAN 


Q6uxz4 


homo sapien 


14 


192 


34. 


3 


1267 


2 


Q68DX9 


Q68dx9 


homo sapien 


15 


192 


34. 


3 


1769 


2 


097758 


097758 


canis famil 



16 


191 


34. 


2 


757 


2 


Q91YW2 


Q91yw2 


mus musculu 


17 


184 


32. 


9 


1692 


2 


Q6MZU1 


Q6mzul 


hoino sapien 


18 


163. 5 


29. 


2 


1736 


1 


Z01 HUMAN 


Q07157 


homo sapien 


19 


162 . 5 


29.1 


1745 


1 


ZOl MOUSE 


P39447 


mus musculu 


20 


155.5 


27 . 


8 


2134 


2 


Q7QAT6 


Q7qat6 


anopheles g 


21 


145 . 5 


26. 


0 


518 


2 


Q8IV45 


Q8iv45 


homo sapien 


22 


142 . 5 


25 . 


5 


1695 


2 


Q9BKL2 


Q9bkl2 


hydra atten 


23 


139 . 5 


25. 


0 


2090 


2 


09VHK3 


09vhk3 


d rosoDhila 


24 


135 . 5 


24 . 


2 


518 


2 


06R653 


06r653 


mus musculu 


25 


133 . 5 


23. 


9 


876 


2 


Q7 PW7 8 


O7ow78 


anopheles g 


26 


116 


20. 


8 


149 


2 


07YS63 


07vs63 


ens scrofa 


27 


101. 5 


18. 


2 


759 


2 


Q7PW77 


Q7pw77 


anopheles g 


28 


98.5 


17 . 


6 


1109 


2 


O8I103 


O8H03 


caenorhabdi 


29 


96 


17 . 


2 


919 


1 


UNC5 CAE EL 


Q26261 


caenorhabdi 


30 


86 


15. 


4 


1072 


1 


UNC5 DROME 


095tu8 


d to s onh i 1 a 


31 


80 . 5 


14 . 


4 


370 


2 


Q72 J31 


Q72 j31 


thermus the 


32 


78 


14 . 


0 


110 


2 


080N4 6 


08an4 6 


cowdox viru 


33 


76.5 


13 . 


7 


715 


2 


Q8IZW8 


Q8izw8 


homo sapien 


34 


76 


13 . 


6 


230 


2 


06Z007 


06zaa7 


homo saDien 


35 


76 






258 


2 


07WZ45 

s»/ / •» *J ^ <»</ 


07wz4 5 




36 


76 


13 . 




346 


1 


HTOM CHTCK 


092056 


rra "1 1 u s oa 1 "L 


37 


74 . 5 


13. 


3 


374 


2 








38 


74 


13. 


2 


121 


2 


08 JTG1 


08i tal 


rnwnox viru 




74 


13. 


2 


1 ^ 








uiy tiCi o ci ( — l v 


40 


73 . 5 


13. 


1 


1330 


2 




Q7r3t6 


rri P5 Tdi 3 1 am 

V-^ -L C-l J_ _1_ X CliU 


41 


73 


13. 


1 


1098 


2 


Q61304 


Q61304 


mus musculu 


42 


73 


13. 


1 


1136 


2 


Q9N180 


Q9nl80 


bos taurus 


43 


73 


13. 


1 


1848 


2 


Q61302 


Q61302 


mus musculu 


44 


73 


13. 


1 


1862 


1 


ANK1 MOUSE 


Q02357 


mus musculu 


45 


72.5 


13. 


0 


120 


2 


Q90XH5 


Q90xh5 


coturnix co 



ALIGNMENTS 



RESULT 1 
UN 5 A_HUMAN 

ID UN5A_HUMAN STANDARD; PRT; 842 AA. 

AC Q6ZN44; Q8TF26; Q96GP4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT • 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 
GN Name=UNC5A; Synonyms=KIAA1976, UNC5H1; 
OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N . A. (IS0F0RM2). 
RC TISSUE=Brain; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T. , Suzuki Y . , Nishikawa T. f Otsuki T... , Sugiyama T.. f Irie R., 
RA Wakamatsu A. , Hayashi K., Sato H-, Nagai K., Kimura K. , Makita H. f 
RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S. f 
RA Yamamoto J. -I., Saito K., Kawai Y., Isono Y. f Nakamura Y., 
RA Nagahari K. , Murakami K., Yasuda T., Iwayanagi T. , Wagatsuma M. , 



RA Shiratori A., Sudo H., Hosoiri T., Kaku Y. , Kodaira H., Kondo H., 

RA Sugawara M. , Takahashi M. , Kanda K. , Yokoi T., Furuya T., Kikkawa E., 

RA Omura Y., Abe K., Kamihara K. , Katsuta N., Sato K., Tanikawa M. , 

RA Yamazaki M. , Ninomiya K. , Ishibashi T., Yamashita H., Murakawa K. , 

RA Fujimori K., Tanai H., Kimata M. , Watanabe M. , Hiraoka S., Chiba Y., 

RA Ishida S., Ono Y. , Takiguchi S., Watanabe S., Yosida M. , Hotuta T., 

RA Kusano J., Kanehori K., Takahashi-Fu jii A., Hara H., Tanase T.-O., 

RA Nomura Y., Togiya S., Komai F. , Hara R. , Takeuchi K., Arita M. , 

RA Imose N . , Musashino K. , Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y., Matsunawa H. , Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F. , Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M. , Watanabe K. , Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M. , Tashiro H. , Tanigami A., Fujiwara T., 

RA Ono T . , Yamada K. , Fujii Y., Ozaki K. , Hirao M. , Ohmori Y., 

RA Kawabata A. , Hikiji T., Kobatake N. , Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T., Shigeta K. , Senba T., 

RA Matsumura K., Nakajima Y., Mizurio T., Morinaga M. , Sasaki M. , 

RA Togashi T., Oyama M. , Hata H. , Watanabe M., Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y. , Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R. , 

RA Nakai K. , Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND SEQUENCE OF 302-842 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E. , 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E. f 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE OF 624-728 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=21842142; PubMed=11853319 ; 

RA Nagase T., Kikuno R., Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XXII. 

RT The complete sequences of 50 new cDNA clones which code for large 



RT proteins."; 

RL DNA Res. 8:319-327(2001). 

RN [4] 

RP INDUCTION. 

RX PubMed=12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K., Fukuda S., Nakamura Y., Arakawa H.; 

RT "p53RDLl regulates of p53-dependent apoptosis . " ; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [5] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100 ; 

RA Thiebault K. , Mazelin L. , Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q6ZN44-l; Sequence=Displayed; 

CC Note=No experimental confirmation available; 

CC Name=2 ; 

CC IsoId=Q6ZN44-2; Sequence=VSP_011694 , VSP_011695; 

CC Note=No experimental confirmation available; 

CC Name=3 ; 

CC IsoId=Q6ZN44-3 ; Sequence=VSP_011693 ; 

CC Note=No experimental confirmation available; 

CC -!- INDUCTION: By p53/TP53. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1 , which 

CC participates in the induction of apoptosis (By similarity) . 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues. 

CC Phosphorylated by PKC in vitro (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- MISCELLANEOUS: Down-regulated in multiple cancers including 
CC colorectal , breast , ovary, uterus, stomach, lung, or kidney 

CC cancers . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 1 TSP type-1 domain. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 



cc 


-!- CAUTION: Ref.3 


sequence 


differs from that shown due to the 


cc 


presence of introns . 






cc 
cc 












This SWISS-PROT entry is 


copyright. It is produced through a collaboration 


cc 


between 


the Swiss 


Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics 


Institute. There are no restrictions on its 


cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this 


statement 


is not removed. Usage by and for commercial 


cc 


entities 


requires 


a license 


agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 


EMBL; AK131380; BAD18531. 


1; 




DR 


EMBL; BC009333; AAH09333. 


2; 




DR 


EMBL; BC033727; -; 


NOT ANNOTATED CDS. 


DR 


EMBL; AB075856; BAB85562. 


l; 


ALT_SEQ. 


DR 


Genew; HGNC: 12567; 


UNC5A. 






DR 


MIM; 607869; 








DR 


InterPro; 


IPR003599; Ig. 






DR 


InterPro; 


IPR007110; Ig-like. 


DR 


InterPro; 


IPR000884; TSP1 






DR 


Pfam; PF00047; ig; 


1. 






DR 


Pfam; PF00090; TSP 


l; l. 






DR 


SMART; SM00409; IG 


; l. 






DR 


SMART; SM00209; TSP1; 1. 






DR 


PROSITE; 


PS50017; 


DEATH DOMAIN; FALSE NEG. 


DR 


PROSITE; 


PS50835; 


IG_LIKE 


; 


1. 


DR 


PROSITE; 


PS50092; 


TSP1; 1 






KW 


Alternative splicing; Apoptosis; Developmental protein; 


KW 


Immunoglobulin domain; Phosphorylation; Receptor; Signal; 


KW 


Transmembrane . 








FT 


SIGNAL 


1 


25 




Potential . 


FT 


CHAIN 


26 


842 




Netrin receptor UNC5A. 


FT 


DOMAIN 


1 


306 




Extracellular (Potential) . 


FT 


TRANS MEM 


307 


327 




Potential . 


FT 


DOMAIN 


328 


842 




Cytoplasmic (Potential) . 


FT 


DOMAIN 


44 


141 




Ig-like. 


FT 


DOMAIN 


155 


234 




Ig-like C2-type. 


FT 


DOMAIN 


242 


294 




TSP type-1. 


FT 


DOMAIN 


439 


542 




ZU5. 


FT 


DOMAIN 


761 


841 




Death. 


FT 


SITE 


340 


341 




Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


605 


623 




Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 




By similarity. 


FT 


DISULFID 


170 


221 




By similarity. 


FT 


CARBOHYD 


107 


107 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


287 


287 




N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


1 


97 




MAVRP GLW PALLG I VLAAWL RG S GAQQS AT VAN P VP GAN P D 


FT 










LLPHFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWV 


FT 










RQVDHVI ERSTDGSN -> MAGTSERSLISSISQPKAIECF 


FT 










EVKKKAFLTHGRYHGSGATPPKTKDPKPETFCGQT ( in 


FT 










isoform 3) . 


FT 










/FTId=VSP_011693. 


FT 


VARSPLIC 


296 


301 




TASGPE -> SESSLP (in isoform 2). 


FT 










/FTId=VSP_011694. 


FT 


VARSPLIC 


302 


842 




Missing (in isoform 2) . 


FT 










/FTId=VSP 011695. 



SQ SEQUENCE 842 AA; 



92958 MW; 3DFADCF973131849 CRC64; 



Query Match 100.0%; Score 559; DB 1; Length 842; 

Best Local Similarity 100.0%; Pred. No. 7.7e-52; 

Matches 104; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TSNMT YGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 439 TSNMTYGTFNFLGGRMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 498 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 499 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 542 



RESULT 2 
UN5A_MOUSE 

ID UN5A_MOUSE STANDARD; PRT; 898 AA. 

AC Q8K1S4; Q6PEF7; Q80T71; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name-Unc5a; Synonyms=Kiaal976, UncShl; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBIJTaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX MEDLINE=22239710; PubMed=1235118 6; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RC TISSUE=Brain; 

RX MEDLINE=22579291; PubMed=12693553; 

RA Okazaki N., Kikuno R. , Ohara R. , Inamoto S., Aizawa H., Yuasa S., 

RA Nakajima D. , Nagase T., Ohara O., Koga H.; 

RT "Prediction of the coding sequences of mouse homologues of KIAA gene: 

RT II. The complete nucleotide sequences of 400 mouse KIAA-homologous 

RT cDNAs identified by screening of terminal sequences of cDNA clones 

RT randomly sampled from size-fractionated libraries."; 

RL DNA Res. 10:35-48(2003). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas. 242603899 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D.; 

RA Aitschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T . B. , Toshiyuki S., Carninci P., Prange C, 



RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C. , Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchinan J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q8KlS4-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS4-2; Sequence=VSP_011697 ; 

CC Note=No experimental confirmation available; 

CC Name=3; 

CC IsoId=Q8KlS4-3; Sequence=VSP_011696; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Restricted to central nervous system. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 

CC participates in the induction of apoptosis (By similarity) . 

CC -!- PTM: Phosphorylated by PKC in vitro. Phosphorylated on cytoplasmic 

CC tyrosine residues (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 



CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
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entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AJ487852; CAD32250.1; -. 
EMBL; AK122575; BAC65857.1; ALT_INIT. 
EMBL; BC058084; AAH58084.1; -. 
HSSP; P07996; 1LSL. 

894682; UncSa. 



MGD; MGI: 
InterPro; IPR000488; Death. 
InterPro; IPR011029; DEATH_like. 
InterPro; IPR003599; Ig. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR000884; TSP1. 
InterPro; IPR008085; TSP_1. 
InterPro; IPR000906; ZU5 . 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00409; IG; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5 ; 1. 

PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptbsis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 



KW 


Transmembrane . 






FT 


SIGNAL 


1 


25 


Potential. 


FT 


CHAIN 


26 


898 


Netrin receptor UNC5A. 


FT 


DOMAIN 


26 


361 


Extracellular (Potential) . 


FT 


TRANS MEM 


362 


382 


Potential . 


FT 


DOMAIN 


383 


898 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


44 


141 


Ig-like. 


FT 


DOMAIN 


155 


234 


Ig-like C2-type. 


FT 


DOMAIN 


242 


296 


TSP type-1 1. 


FT 


DOMAIN 


298 


350 


TSP type-1 2. 


FT 


DOMAIN 


495 


598 


ZU5. 


FT 


DOMAIN 


817 


897 


Death. 


FT 


SITE 


396 


397 


Cleavage (by caspase-3) (By similarity) 


FT 


SITE 


661 


679 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 


By similarity. 


FT 


DISULFID 


170 


221 


By similarity. 


FT 


CARBOHYD 


107 


107 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


343 


343 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


1 


790 


Missing (in isoform 3) . 


FT 








/FTId=VSP_0ll696. 


FT 


VARSPLIC 


241 


296 


Missing (in isoform 2) . 


FT 








/FTId=VSP 011697. 


FT 


CONFLICT 


217 


217 


A ->■ P (in Ref . 3) . 


SQ 


SEQUENCE 


898 AA; 


98856 


MW; 59F04BA2E196C1DB CRC64; 



Query Match 

Best Local Similarity 



99.1%; 
99.0%; 



Score 554; DB 1; Length 898; 
Pred. No. 2.9e-51; 



Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 495 TSNMAYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 555 LSPIVSCGPPGVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 3 
UN5A_RAT 

ID UN5A_RAT STANDARD; PRT; 898 AA. 

AC 008721; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name=Unc5a; Synonyms=Unc5hl ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, SUBCELLULAR LOCATION, AND TISSUE 

RP SPECIFICITY. 

RC TISSUE=Ventral spinal cord; 

RX MEDLINE=97271897; PubMed=9126742 ; 

RA Leonardo E.D., Hinck L. , Masu M. , Keino-Masu K., Ackerman S.L., 

RA Tessier-Lavigne M. ; 

RT "Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

RT receptors."; 

RL Nature 386:833-838(1997). 

RN [2] 

RP FUNCTION, AND INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K. , Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 

RL Cell 97:927-941(1999). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX PubMed=11472849; 

RA Barrett C, Guthrie S.; 

RT "Expression patterns of the netrin receptor UNC5H1 among developing 

RT motor neurons in the embryonic rat hindbrain."; 

RL Mech. Dev. 106:163-166(2001). 

RN [4] 

RP FUNCTION. 

RX PubMed=11387206; DOI-10. 1093/embo j/20 . 11 . 2715; 

RA Llambi F., Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

RN [5] 



RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH MAGED1 . 

RX PubMed=12598531; DOI=10 . 1074/ jbc .M300415200 ; 

RA Williams M.E., Strickland P., Watanabe K., Hinck L. ; 

RT "UNC5H1 induces apoptosis via its juxtamembrane region through an 

RT interaction with NRAGE . " ; 

RL J. Biol. Chem. 278:17483-17490(2003). 

RN [6] 

RP INTERACTION WITH PRKCABP, PHOSPHORYLATION, AND MUTAGENESIS OF 

RP 896-ALA— CYS-898 . 

RX PubMed=14672991; DOI=23/36/11279 ; 

RA Williams M.E., Wu S.C.-Y., McKenna W.L., Hinck L. ; 

RT "Surface expression of the netrin receptor UNC5H1 is regulated through 

RT a protein kinase C-interacting protein/protein kinase-dependent 

RT mechanism."; 

RL J. Neurosci. 23:11279-11288(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC . removal from surface of neurons and growth cones . 

CC TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 

CC neurons. Expressed at early stages of neural tube development in 

CC the ventral spinal cord. In developing hindbrain, it colocalizes 

CC with a number of cranial motor neuron subpopulations from 

CC embryonic Ell to E14, while DCC is expressed by motor neurons at 

CC E12 . Also expressed in non-neural structures, such as the basal 

CC plane of the hindbrain and midbrain, in the developing 

CC hypothalamus, thalamus and in the pallidium. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 

CC participates in the induction of apoptosis. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . Phosphorylated by PKC in vitro. 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 



cc 

DR EMBL; U87305; AAB57678.1; -. 

DR HSSP; P07996; 1LSL. 

DR RGD; 621755; Unc5hl. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN ; FALSEJtfEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Apoptosis; Developmental protein; Immunoglobulin domain; 

KW Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


25 


Potential . 


FT 


CHAIN 


26 


898 


Netrin receptor UNC5A. 


FT 


DOMAIN 


26 


361 


Extracellular (Potential) . 


FT 


TRANS MEM 


362 


382 


Potential . 


FT 


DOMAIN 


383 


898 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


44 


141 


Ig-like . 


FT 


DOMAIN 


155 


238 


Ig-like C2-type. 


FT 


DOMAIN 


242 


296 


TSP type-1 1. 


FT 


DOMAIN 


298 


350 


TSP type-1 2. 


FT 


DOMAIN 


495 


598 


ZU5. 


FT 


DOMAIN 


817 


897 


Death. 


FT 


SITE 


396 


397 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


661 


679 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 


By- similarity. 


FT 


DISULFID 


170 


221 


By similarity. 


FT 


CARBOHYD 


107 


107 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


343 


343 


N-linked (GlcNAc. . .) (Potential). 


FT 


MUTAGEN 


896 


898 


Missing: Abolishes interaction with 


FT 








PRKCABP . 


SQ 


SEQUENCE 


898 AA; 


98840 


MW; 7A3CBCB9E7ACA135 CRC64; 



Query Match 98.9%; Score 553; DB 1; Length 898; 

Best Local Similarity 98.1%; Pred. No. 3.8e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

Nil I I I I I I I I I I I III III II I III III I I I I III II I I I I I III III I I I I I I I I I 

Db 495 TSNMAYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 554 



Qy 



61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 
I II : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 



Db 555 LSPWSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 598 



RESULT 4 
UN5C_M0USE 

ID UN5C_M0USE STANDARD; PRT; 931 AA. 

AC 008747; Q8CD16; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 

DE (Rostral cerebellar malformation protein) . 

GN Name=Unc5c; Synonyms =Rcm, Unc5h3; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBIJTaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), FUNCTION, DISEASE,- AND TISSUE 

RP SPECIFICITY. 

RC STRAIN=C57B6/SJL; 

RX MEDLINE= 97271898; PubMed= 9 126743; 

RA Ackerman S.L., Kozak L.P., Przyborski S.A. , Rund L.A. , Boyer B.B., 

RA Knowles B.B.; 

RT "The mouse rostral cerebellar malformation gene encodes an UNC-5-lik 

RT protein."; 

RL Nature 386:838-842(1997). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 

RX MEDLINE=22354683; PubMed=124668 51; DOI=10 . 1038/nature01266; 

RA Okazaki Y. , Furuno M. , Kasukawa T., Adachi J. , Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H. , 

RA Yagi K., Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S 

RA Dalla E. , Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A. , Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y. , Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L., Miki H., 

RA Nagashima T., Numata K., Okido T., Pa van W.J., Pertea G., Pesole G. , 

RA Petrovsky N., Pillai R. , Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M., 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K., 

RA Sultana R., Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R., Wagner L., Wahlestedt C, Wang Y., Watanabe Y., Wells C. 

RA Wilming L.G.,' Wynshaw-Boris A. , Yanagisawa M. , Yang I., Yang L. , 

RA Yuan Z., Zavolan M. , Zhu Y., Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H. , Nakamura M. , Sakazume N., Sato K., 

RA Shiraki T., Waki K. , Kawai J., Aizawa K., Arakawa T., Fukuda S. , 

RA Hara A., Hashizume W., Imotani K., Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D. , Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S W Rogers J., 

RA Birney E., Hayashizaki Y. ; 



RT "Analysis of the mouse trans criptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP FUNCTION, AND TISSUE SPECIFICITY. 

RX PubMed=9389662; 

RA Przyborski S.A., Knowles B.B., Ackerman S.L.; 

RT "Embryonic phenotype of Unc5h3 mutant mice suggests chemorepulsion 

RT during the formation of the rostral cerebellar boundary."; 

RL Development 125:41-50(1998). 

RN [4] 

RP INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K., Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 

RL Cell 97:927-941(1999). 

RN [5] 

RP PHOSPHORYLATION SITE TYR-568, AND MUTAGENESIS OF TYR-568. 

RX PubMed=11533026; DOI=10 . 1074/ jbc .M103872200 ; 

RA Tong J. , Killeen M. , Steven R. , Binns K.L., Culotti J., Pawson T.; 

RT "Netrin stimulates tyrosine phosphorylation of the UNC-5 family of 

RT netrin receptors and induces Shp2 binding to the RCM cytodomain. " ; 

RL J. Biol. Chem. 276:40917-40925(2001). 

RN [6] 

RP FUNCTION. 

RX PubMed=12451134; DOI=22/23/10346; 

RA Finger J.H., Bronson R. T . , Harris B., Johnson K. , Przyborski S.A. , 

RA Ackerman S.L.; 

RT "The netrin 1 receptors Unc5h3 and Dec are necessary at multiple 

RT choice points for the guidance of corticospinal tract axons."; 

RL J. Neurosci. 22:10346-10356(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC - !- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=O08747-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=O08747-2; Sequence=VSP_011702 ; 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Highly expressed in brain and lung. Weakly expressed in 

CC testis , ovary, spleen, thymus and bladder. Expressed at very low 

CC level in kidney, intestine and salivary gland. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 
CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 
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phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

netrin-dependent . 
-!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

cleavage does not take place when the receptor is associated with 

netrin ligand. Its cleavage by caspases is required to induce 

apoptosis (By similarity) . 
-!- DISEASE: Defects in Unc5c are the cause of rostral cerebellar 

malformation (Rem) . Rem is characterized by cerebellar and 

midbrain defects, apparently as a result of abnormal neuronal 

migration. 

SIMILARITY: Belongs to the UNC-5 family. 
SIMILARITY: Contains 1 death domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 
SIMILARITY: Contains 1 immunoglobulin-like domain. 
SIMILARITY: Contains 2 TSP type-1 domains. 
SIMILARITY: Contains 1 ZU5 domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U72634; AAB54103.1; 

EMBL; AK031655; BAC27495.1; -. 

HSSP; P07996; 1LSL. 

MGD; MGI: 1095412; Unc5c. 

GO; GO: 0005886; C:plasma membrane; IC. 

F: netrin receptor activity; IDA. 
F: protein binding; IDA. 
P: brain development; IMP. 
P: regulation of cell migration; IMP. 
IPR000488; Death. 
I PRO 11 02 9; DEATH_like. 
IPR007110; Ig-like. 
IPR003598; Ig_c2. 
IPR000884; TSP1. 
IPR008085; TSP_1. 
IPR000906; ZU5 . 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATHJDOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 
Transmembrane . 

SIGNAL 1 40 Potential. 



GO; 
GO; 
GO; 
GO; 



GO:0005042; 
GO:0005515; 
GO:0007420; 
GO: 0030334; 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
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leading to a increased level of 


FT 








phosphorylation . 


FT 


CONFLICT 


16 


16 


L -> I (in Ref . 2) . 

X_J ^ «!■ \ XX X X, • / • 


FT 


CONFLICT 


733 


733 


H -> R (in Ref. 2) . 


FT 


CONFLICT 


924 


924 


S -> Y (in Ref. 2) . 


SQ 


SEQUENCE 


931 AA; 


103062 


MW; 8A5D951A4EECA17 9 CRC64; 


Query Match 




62.3%; 


Score 348; DB 1; Length 931; 



Best Local Similarity 58.6%; Pred. No. 7.6e-29; 

Matches 58; Conservative 23; Mismatches 18; Indels 0; Gaps 0; 
Qy 6 YGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIV 65 



Db 533 FGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRVTEMYVTVHRKENMRPPMEDSQTLLTPVV 592 



Qy 66 SCGPPGVLLTRPVI LAMDHCGEP S PDSWS LRLKKQS CEG 104 

I I I I I I I I I I I I I I : II : I I : I :: I I I : : I 
Db 593 S CGP P GALLT RP VI LT LHHCAD P S T E DWK I Q LKNQAVQ G 631 



RESULT 5 
UN5C_CHICK 

ID UN5C_CHICK STANDARD; PRT; 931 AA. 

AC Q7T2Z5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 

DE (CUNC-5H3) . 

GN Name=UNC5C; 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 



RX 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
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PubMed=127 99087; 
Guan W., Condic M.L.; 

"Characterization of Netrin-1, Neogenin and CUNC-5H3 expression during 
chick dorsal root ganglia development."; 
Gene Expr. Patterns 3 : 369-373 (2003 ) . 

-!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
axon repulsion of neuronal growth cones in the developing nervous 
system upon ligand binding (By similarity) . 

SUBCELLULAR LOCATION: Type I membrane protein (By similarity) . 
TISSUE SPECIFICITY: Restricted to proprioceptive neurons. 
PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
similarity) . 

SIMILARITY: Belongs to the UNC-5 family. 
SIMILARITY: Contains 1 death domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 
SIMILARITY: Contains 1 immunoglobulin-like domain. 
Contains 2 TSP type-1 domains. 
1 ZU5 domain. 



SIMILARITY: 
SIMILARITY: 



Contains 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinformatics and the EMBL outstation - 
the European Bioinformatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; AY187310; AA067275.1; -. 
InterPro; IPR000488; Death. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003598; Ig_c2 . 
InterPro; IPR000884; TSP1. 
InterPro; IPR000906; ZU5 . 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IGJLIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Developmental protein; Immunoglobulin domain; Phosphorylation; 



KW 


Receptor; 


Repeat; 


Signal; 


Transmembrane . 


FT 


SIGNAL 


1 


39 


Potential . 


FT 


CHAIN 


40 


931 


Netrin receptor UNC5C. 


FT 


DOMAIN 


40 


380 


Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


931 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 


Ig-like. 


FT 


DOMAIN 


161 


256 


Ig-like C2-type. 


FT 


DOMAIN 


260 


314 


TSP type-1 1. 


FT 


DOMAIN 


316 


368 


TSP type-1 2. 


FT 


DOMAIN 


528 


631 


ZU5. 


FT 


DOMAIN 


850 


929 


Death. 



FT DISULFID 83 142 By similarity. 

FT DISULFID 188 239 By similarity. 

FT CARBOHYD 236 236 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 361 361 N-linked (GlcNAc. . .) (Potential). 

SQ SEQUENCE 931 AA; 102906 MW; 1E23A0D84F2E2C62 CRC64; 

Query Match 61.9%; Score 346; DB 1; Length 931; 
Best Local Similarity 58.6%; Pred. No. 1.3e-28; 

Matches 58; Conservative 22; Mismatches 19; Indels 0; Gaps 0; 

Qy 6 YGTFNFLGGRLMI PNTGI SLLIPPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIV 65 

rllll III I : I I I : I : I I I I I I : I : I : : I I : I : I : I : I :| |: llll:|:| 

Db 533 FGTFNSLGGHLVIPNSGVSLLIPAGAVPQGRVYEMYVTVHRKEGMRPPVEDSQTLLTPW 592 

Qy 66 SCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I : I I II II: I I :: I I I : : I 

Db 593 SCGPPGALLTRPWLTMHHCAEPNMDDWQIQLKHQAGQG 631 

RESULT 6 
UN5C_RAT 

ID UN5C_RAT STANDARD; PRT; 931 AA. 

AC Q761X5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) . 

GN Name=Unc5c; Synonyms=Unc5h3; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. , AND DISEASE. 

RX PubMed=15010202 ; DOI=10 . 1016/ j .molbrainres .2003 . 12 . 003; 

RA Kuramoto T., Kuwamura M. , Serikawa T.; 

RT "Rat neurological mutations cerebellar vermis defect and hobble are 

RT caused by mutations in the netrin-1 receptor gene Unc5h3."; 

RL Brain Res. Mol. Brain Res. 122:103-108(2004). 

RN [2] 

RP FUNCTION. 

RX PubMed=11387206; DOI=10 . 1093/emboj/20 . 11 . 2715; 

RA Llambi F. , Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-1 acts as a survival factor via its receptors UNC5H and DCC." ; 

RL EMBO J. 20:2715-2722(2001). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I .membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 



CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis . The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC -!- DISEASE: Defects in Unc5c are the cause of cerebellar vermis 

CC defect (cvd) and hobble (hob) phenotypes . Cvd and hob rats exhibit 

CC cerebellar and midbrain defects, possibly as a result of abnormal 

CC neuronal migration, and exhibit laminar structure abnormalities in 

CC the fused cerebellar hemispheres and ectopic cerebellar tissues in 

CC the cerebello-pontine junction. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AB118026; BAD05181.1; -. 

DR RGD; 735109; UncSc. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2 . 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM004 09; IG; 1. 

DR SMART; SM004 08; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR 1 PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Apoptosis; Developmental protein; Immunoglobulin domain; 

KW Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 

FT SIGNAL 1 40 Potential. 



IV' 1 1 


fUATM 


4 1 


-7 -J -L 




£ X 




4 1 
H J. 




PvfTarol 1 ill ar / Pr\i"Cint" "i a 1 \ 


FT 


TDZiMCJMFM 
i. rvrUN omCiTl 


^ft 1 
«3 0 J. 






FT 




a no 
4 uz 




LOpiaSIulC ^rOtcULlal/ . 


FT 


DUJYLA1JN 


£9 




Ig — like • 


FT 


HfWZlTM 


1D1 




J. y 1J.A.C ^z. Lype • 


FT 




9 fin 


>3 x *± 


i or Lypc ± j. • 


FT 


nnMBTM 
DUJYIAIIn 


^1 fi 


O DO 


loxr type i z. 


FT 


UUJYLAJ.1N 


top 
DZ O 


DjI 


7TT S 


FT 




pen 
0 3 U 




UcdLIl . 


FT 


bl lllj 




A 1 £ 
fi lb 


Cleavage (by caspase— 3) (By similarity) . 
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FT 


DISULFID 


188 


239 


By similarity. 


FT 


MOD_RES 


568 


568 


Phosphotyrosine (By similarity) . 


FT 


CARBOHYD 


236 


236 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


361 


361 


N-linked (GlcNAc. . .) (Potential). 


SQ 


SEQUENCE 


931 AA; 


103134 


MW; 25B183A97BCB8401 CRC64; 



Query Match 61.7%; Score 345; DB 1; Length 931; 

Best Local Similarity 57.6%; Pred. No. 1.6e-28; 

Matches 57; Conservative 24; Mismatches 18; Indels 0; Gaps 0; 

Qy 6 YGTTOFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I I I I I I I I : I I I : I : I I I I I I I I : I : : I I : I : I : I : I : : I' I : I I I I : I : I 
Db 533 FGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRVTEMYVTVHRKENMRPPMEDSQTLLTPVV 592 

Qy 66 SCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I I I : I I : I : : I :: I I I : : I 
Db 593 SCGPPGALLTRPVILTLHHCADPNTEDWKIQLKNQAVQG 631 



RESULT 7 
UN5C_HUMAN 

ID UN5C_HUMAN STANDARD; PRT; 931 AA. 

AC 095185; Q8IUT0; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3). 

GN Name=UNC5C; Synonyms=UNC5H3 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RC TISSUE=Brain; 

RX MEDLINE=99000841; PubMed=9782087 ; DOI=10 . 1006/geno . 1998 . 5425 ; 

RA Ackerman S.L., Knowles B.B.; 

RT "Cloning and mapping of the UNC5C gene to human chromosome 4q21-q23." ; 

RL Genomics 52:205-208(1998). 

RN [2] - 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 



RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., BhatN.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N . A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12 655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K., Mazelin L., Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC Isold=095185-1; Sequence=Displayed; 

CC Name =2 ; 

CC IsoId=095185-2; Sequence=VSP_011700, VSP_011701; 

CC -!- TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 

CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation . Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- MISCELLANEOUS: Down-regulated in multiple cancers including 
CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 
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cancers . 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 



Belongs to the UNC-5 family. 
Contains 1 death domain. 

Contains 1 immunoglobulin-like C2-type domain. 
Contains 1 immunoglobulin-like domain. 
Contains 2 TSP type-1 domains. 
Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF055634; AAC67491.1; -. 
EMBL; BC041156; AAH41156.1; -. 
HSSP; P07996; 1LSL. 
Genew; HGNC: 12569; UNC5C. 
MIM; 603610; -. 

GO; GO: 0005042; F:netrin receptor activity; TAS . 
GO; GO: 0007411; P:axon guidance; TAS. 
GO; GO: 0007420; P:brain development; TAS. 
InterPro; IPR000488; Death. 

DEATH_like. 
Ig-like. 
Ig_c2 . 
TSP1. 
TSP_1. 
ZU5. 

Pfam; PF00531; Death; 1 . 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Polymorphism; Receptor; 
Repeat; Signal; Transmembrane. 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



IPR011029; 
IPR007110; 
IPR003598; 
IPR000884; 
IPR008085; 
IPR000906; 
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Cytoplasmic (Potential) . 
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Ig-like. 
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Ig-like C2-type. 
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TSP type-1 1. 
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TSP type-1 2. 
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Death. 
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By similarity. 
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N-linked (GlcNAc ) (Potential) . 
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N-linked (GlcNAc ) (Potential ) 
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VARIANT 


721 
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T -> M (in dbSNP:2289043) . 


FT 








/ FTId=VAR 019732. 


FT 


CONFLICT 


• 219 


219 


T -> I (in Ref . 1) . 


FT 


CONFLICT 


489 


489 


S -> T (in Ref. 1) . 


SQ 


SEQUENCE 


931 AA; 


103101 


MW; EFD71122C98DABB8 CRC64; 



Query Match 60.3%; Score 337; DB 1; Length 931; 

Best Local Similarity 55.6%; Pred. No. 1.2e-27; 

Matches 55; Conservative 24; Mismatches 20; Indels 0; Gaps 0; 



Qy 6 YGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIV 65 

: I : I I III I : : I I : I : I I I I I I I I : I : : I I : I : I : I : I : I I : I I I I : I : I 
Db 533 FGSFNSLGGHLIVPNSGVSLLIPAGAIPQGRVYEMYVTVHRKETMRPPMDDSQTLLTPW 592 

Qy 66 SCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I I I I I I I I I : I I II : I : : I : II I : : I 
Db 593 SCGPPGALLTRPWLTMHHCADPNTEDWKILLKNQAAQG 631 



RESULT 8 
UN5B XENLA 



ID UN5B_XENLA STANDARD; PRT; 943 AA. 

AC Q8JGT4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (UNC-5 homolog) (Protein XUNC-5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFICITY. 

RX MEDLINE=22239703; PubMed=12351179 ; DOI=10. 1016/S0925-4773 (02) 00215-0; 

RA Anderson R. B . , Holt C.E.; 

RT "Expression of UNC-5 in the developing Xenopus visual system."; 

RL Mech. Dev. 118:157-160(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: In the developing visual system, it is 

CC expressed within the developing optic vesicles and later become 
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restricted to the dorsal ciliary marginal zone, a site of 
retinoblast proliferation and differentiation. 
PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
similarity) . 

SIMILARITY: Belongs to the UNC-5 family. 
SIMILARITY: Contains 1 death domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 
SIMILARITY: Contains 1 immunoglobulin-like domain. 
SIMILARITY: Contains 2 TSP type-1 domains. 
SIMILARITY: Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AY099459; AAM34486.1; -. 
HSSP; P07996; 1LSL. 
InterPro; IPR000488; 

IPR011029; 
IPR007110; 
IPR003598; 
IPR000884; TSP1. 
IPR008085; TSPl 
IPR000906; 
Pfam; PF00531; Death; 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP 1; 2. 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



Death. 
DEATH_like. 
Ig-like . 
Ig_c2. 



ZU5. 
1. 
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Potential . 


FT 


CHAIN 


31 


943 


Netrin receptor UNC5B. 


FT 


DOMAIN 


31 


380 


Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


943 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


51 


148 


Ig-like. 


FT 


DOMAIN 


150 


245 


Ig-like C2-type. 


FT 


DOMAIN 


249 


303 


TSP type-1 1. 


FT' 


DOMAIN 


305 


357 


TSP type-1 2. 


FT 


DOMAIN 


540 


643 


ZU5. 


FT 


DOMAIN 


863 


941 


Death . 


FT 


DISULFID 


72 


131 


By similarity. 


FT 


DISULFID 


177 


228 


By similarity. 


FT 


CARBOHYD 


225 


225 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


350 


350 


N-linked (GlcNAc. . .) (Potential) 



SQ SEQUENCE 943 AA; 105083 MW; A024E24A7EDB6175 CRC64; 



Query Match 55.3%; Score 309; DB 1; Length 943; 

Best Local Similarity 55.8%; Pred. No. 1.3e-24; 

Matches 58; Conservative 17; Mismatches 29; Indels 0; Gaps 0; 

Qy 1 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 

: I I III I I I I I 1111:11111 I I I : I I I I : I I : : I I : II III: 
Db 540 S SNSVTGTFGSLGGRLT FPNTGVS LLI PQGAI PQGKYYEMYLMINKRENTVLP S EGTQT I 599 

Qy 61 LSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I :: I I I I : I I Mill : II : : I I : I I II : I 
Db 600 LSPIITCGPTGLLLCKPVILTVPHCADINTSDWILQLKTQSHQG 643 



RESULT 9 
UN5B_HUMAN 

ID UN5B_HUMAN STANDARD; PRT; 945 AA. 

AC Q8IZJ1; Q86SN3; Q8N1Y2; Q9H9F3; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) 

DE (p53-regulated receptor for death and life protein 1) 

DE (UNQ1883/PR04326) . 

GN Name=UNC5B; Synonyms=P53RDLl, UNC5H2; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. (ISOFORM 2), TISSUE SPECIFICITY, AND INTERACTION 

RP WITH GNAI2. 

RC TISSUE=Lung; 

RX MEDLINE=22246081; PubMed=12359238 ; DOI=10 . 1016/S0006-291X ( 02 ) 02277-5 ; 

RA Komatsuzaki K. , Dalvin S., Kinane T.B.; 

RT "Modulation of G(ialpha(2)) signaling by the axonal guidance molecule 

RT UNC5H2 . " ; 

RL Biochem. 'Biophys . Res. Comraun. 297:898-905(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1), FUNCTION, AND MUTAGENESIS OF ASP-412. 

RX PubMed-12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K. , Fukuda S., Nakamura Y., Arakawa H.; 

RT "p53RDLl regulates of p53-dependent apoptosis."; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RX MEDLINE=22887296; PubMed=12975309 ; DOI=10 . 1101/gr . 1293003 ; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D., Brush J., 

RA Chen J., Chow B., Chui C, Crowley C, Currell B., Deuel B., Dowd P., 

RA Eaton D. , Foster J., Grimaldi C, Gu Q. , Hass P.E., Heldens S., 

RA Huang A. , Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., 

RA Lewis L. , Liao D. , Mark M. , Robbie E., Sanchez C, Schoenf eld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A. , 

RA Vandlen R. , Watanabe C, Wieand D., Woods K. , Xie M.-H., Yansura D., 

RA Yi S., Yu G., Yuan J., Zhang M., Zhang Z., Goddard A., Wood W.I., 

RA Godowski P., Gray A.; 



RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [4] 

RP SEQUENCE OF 361-945 FROM N.A. 

RC TISSUE=Amygdala, and Teratocarcinoma; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T., Suzuki Y. , . Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A. , Hayashi K., Sato H., Nagai K., Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J- -I., Saito K., Kawai Y., Isono Y., Nakamura Y., 

RA Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M. , 

RA Shiratori A., Sudo H., Hosoiri T., Kaku Y. , Kodaira H., Kondo H., 

RA Sugawara M. , Takahashi M. , Kanda K., Yokoi T., Furuya T., Kikkawa E., 

RA Omura Y., Abe K., Kamihara K., Katsuta N., Sato K. , Tanikawa M. , 

RA Yamazaki M. , Ninomiya K. , Ishibashi T., Yamashita H., Murakawa K., 

RA Fujimori K., Tanai H. , Kimata M. , Watanabe M., Hiraoka S., Chiba Y., 

RA Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M. , Hotuta T., 

RA Kusano J. , Kanehori K. , Takahashi-Fujii A. , Hara H., Tanase T.-O., 

RA Nomura Y. , Togiya S., Komai F. , Hara R. , Takeuchi K., Arita M. , 

RA Imose N. f Musashino K., Yuuki H. f Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y w Matsunawa H., Ichihara T., Shiohata N-, Sano S., 

RA Moriya S. f Momiyama H., Satoh N. r Takami S., Terashima Y., Suzuki 0. f 

RA Nakagawa S. r Senoh A. , Mizoguchi H., Goto Y. f Shimizu F. , Wakebe H., 

RA Hishigaki H., Watanabe T w Sugiyama A., Takemoto M. , Kawakami B. f 

RA Yamazaki M. , Watanabe K. , Kumagai A. , Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M. , Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K. f Fujii Y. , Ozaki K., Hirao M. , Ohmori Y., 

RA Kawabata A. , Hikiji T., Kobatake N., Inagaki H. f Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T. f Shigeta K. , Senba T. f 

RA Matsumura K w Nakajima Y. , Mizuno T. f Morinaga M. , Sasaki M. , 

RA Togashi T., Oyama M. , Hata H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., 

RA Okumura K. , Nagase T. f Nomura N., Kikuchi H., Masuho Y. , Yamashita R. , 

RA Nakai K. , Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S. ; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [5] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K., Mazelin L., Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin ' required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC similarity) . Interacts with GNAI2 via its cytoplasmic part. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



- | . 
_ t . 



Event=Alternative splicing; Named isoforms=2; 
Name=l; 

IsoId=Q8IZ Jl-1; Sequence=Displayed; 
Name=2 ; 

IsoId=Q8IZJl-2; Sequence=VSP_011698 ; 
TISSUE SPECIFICITY: Highly expressed in brain. Also expressed at 
lower level in developing lung, cartilage, kidney and 
hematopoietic and immune tissues. 
INDUCTION: By p53/TP53. 

PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
similarity) . 

PTM: Proteolytically cleaved by caspases during apoptosis . The 
cleavage does not take place when the receptor is associated with 
netrin ligand. Its cleavage by caspases is required to induce 
apoptosis . 

MISCELLANEOUS: Down-regulated in multiple cancers including 
colorectal, breast, ovary, uterus, stomach, lung, or kidney 



cancers . 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 



Belongs to the UNC-5 family. 
Contains 1 death domain. 

Contains 1 immunoglobulin-like C2-type domain. 
Contains 1 immunoglobulin-like domain. 
Contains 2 TSP type-1 domains. 
Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AY126437; AAM95701.1; -. 

EMBL; AB096256; BAC57998.1; 

EMBL; AY358351; AAQ88717.1; 

EMBL; AK022859; BAB14276.1; ALT_INIT. 

EMBL; AK094595; BAC04382.1; ALT_INIT. 

HSSP; P07996; 1LSL. 

Genew; HGNC: 12568; UNC5B. 

MIM; 607870; 

InterPro; IPR000488; Death. 
InterPro; IPR011029; DEATH_like. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003598; Ig_c2. 
InterPro; IPR000884; TSP1. 
InterPro; IPR008085; TSP_1. 
InterPro; IPR000906; ZU5. 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 



L/i\ 


PROSITE; 


PS50017; 


DEATH 


DOMAIN; FALSE NEG. 


no 


PROSITE; 


PS50835; 


IG LIKE; 


1. 


no 
UK 


PROSITE; 


PS50092; 


TSP1; 


2. 




VTaT 


Alternative splicing; Apoptosis; Developmental protein; 




Immunoglobulin domain; 


Phosphorylation; Polymorphism; Receptor; 




Repeat; Signal; Transmembrane. 


FT 




1 


z o 




potential . 


FT 


r-u 7\ T XT 








jMemn receptor unldd. 


FT 


DOMAIN 


£ 1 


Off 




cjXuracexxujLar lirotenuiax; . 


FT 


m t*\ % t bikini/ 

TRANS MEM 


1 'I o 

378 


o n o 

398 




Potential . 


FT 


DOMAIN 


399 


945 




Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 




Ig-like . 


FT 


DOMAIN 


147 


242 




Ig-lxke C2-type. 


FT 


DOMAIN 


246 


300 




TSP type-1 1. 


FT 


DOMAIN 


302 


354 




TSP type-1 2. 


FT 


DOMAIN 


541 


644 




ZU5 . 


FT 


DOMAIN 


o r c 

865 


943 




Death . 


FT 


SITE 


/I T O 

412 


4 1 J 




Cleavage (by caspase-3) . 


FT 


SITE 


707 


725 




Interaction with DCC (By similarity) . 


FT 


DISULFID 


69 


128 




By similarity. 


FT 


DISULFID 


174 


225 




By similarity. 


FT 


CARBOHYD 


222 


222 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 




N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


356 


367 




NKKTLSDPNSHL -> M (in isoform 2) . 


FT 










/FTId=VSP_011698 . 


FT 


VARIANT 


516 


516 




A -> T (in dbSNP: 10509332) . 


FT 










/ FTId=VAR_0 19730 . 


FT 


MUTAGEN 


412 


412 




D->N: Abolishes cleavage by caspase-3 and 


FT 










subsequent induction of apoptosis. 


FT 


CONFLICT 


483 


483 




K -> E (in Ref . 3) . 


FT 


CONFLICT 


851 


851 




L -> P (in Ref. 3; BAB14276) . 


SQ 


SEQUENCE 


945 AA; 


103637 


MW; 56064E335F323447 CRC64; 



Query Match ' 51.3%; Score 287; DB 1; Length 945; 

Best Local Similarity 58.6%; Pred. No. 3.3e-22; 

Matches 58; Conservative 13; Mismatches 26; Indels 2; Gaps 2; 



Qy 7 GTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTLLSPIV 65 

Ml I III I I I I 1:111:1 I I 1:1 I I 1:1 I ::| I III: I Ihlll I 
Db 547 GTFGCLGGRLSIPGTGVSLLVPNGAIPQGKFYEMYLLINKAEST-LPLSEGTQTVLSPSV 605 

Qy 66 SCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

: I I I I : I I I I I I I I I I I I I : I I I : : I 
Db 606 TCGPTGLLLCRPVI LTMPHCAEVSARDWI FQLKTQAHQG 644 



RESULT 10 
UN5B_RAT 

ID UN5B_RAT STANDARD; PRT; 945 AA. 

AC 008722; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) . 
GN Name=Unc5b; Synonyms=Unc5h2 ; 
OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus . 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, SUBCELLULAR LOCATION, AND TISSUE 

RP SPECIFICITY. 

RX MEDLINE=97271897; PubMed=9126742 ; 

RA Leonardo E.D., Hinck L., Masu M. , Keino-Masu K. , Ackerman S.L., 

RA Tessier-Lavigne M. ; 

RT "Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

RT receptors."; 

RL Nature 386:833-838(1997). 

RN [2] 

RP FUNCTION, AND INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K., Hinck L., Nishiyama M., Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 

RL Cell 97:927-941(1999). 

RN [3] 

RP FUNCTION, AND MUTAGENESIS OF ASP- 4 12. 

RX PubMed=11387206; DOI=10 . 1093/emboj /20 . 11 . 2715 ; 

RA Llambi F. , Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with GNAI2 via its cytoplasmic part (By 
CC similarity) . Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Expressed in the developing sensory ganglia that flank 

CC the spinal cord from E12, peaking at E14. Expressed in the roof 

CC plate region of the spinal cord from E14. 

CC PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

, CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U87306; AAB57679.1; -. 

DR HSSP; P07996; 1LSL. 

DR RGD; 621756; Unc5h2 . 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TS PI REPEAT . 



DR 


SMART; SM00005; 


DEATH; 1. 




DR 


SMART; SM00408; 


IGc2; 1. 




DR 


SMART; SM00209; 


TSP1; 2. 




DR 


SMART; SM00218; 


ZU5; 1. 




DR 


PROSITE; 


PS50017; DEATH DOMAIN; 1. 


DR 


PROSITE; 


PS50835; IG_LIKE; 


1. 


DR 


PROSITE; 


PS50092; TSP1; 2. 




KW 


Apoptosis 


; Developmental protein; Immunoglobulin domain; 


KW 


Phosphorylation; 


Receptor; 


Repeat; Signal; Transmembrane. 


FT 


SIGNAL 


1 


26 


Potential . 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANSMEM 


378 


398 


Potential . 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like . 


FT 


DOMAIN 


153 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


300 


TSP type-1 1. 


FT 


DOMAIN 


302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5. 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) . 


FT 


SITE 


707 


725 


Interaction with DCC. 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .-) (Potential) . 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . .) (Potential). 


FT 


MUTAGEN 


412 


412 


D->N: Abolishes cleavage by caspase-3 and 


FT 








subsequent induction of apoptosis. 


SQ 


SEQUENCE 


945 AA; 103520 


MW; 6E9C2A262E560B9B CRC64; 



Query Match 4 9.6%; Score 277; DB 1; Length 945; 

Best Local Similarity 52.9%; Pred. No. 4.1e-21; 

Matches 55; Conservative 17; Mismatches 30; Indels 



2; Gaps 



2; 



Qy 

Db 



2 SNMTYGTFNFLGGRLMrPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTL 60 

I : III I I I I I I I I I : I I I : I I I I : I I I : : I I : : I I I I I : I I I : 
542 SSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTEST-LPLSEGSQTV 600 



Qy 



61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 



I I I I : I I I I : I I I I I : I : I I I I : I I I : : I 

Db 601 LSPSVTCGPTGLLLCRPWLTVPHCAEVIAGDWIFQLKTQAHQG 644 



RESULT 11 
UN5BJVIOUSE 

ID UN5B_MOUSE STANDARD; PRT; 945 AA. 

AC Q8K1S3; Q6PFH0; Q80Y85; Q9D398; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) . 

GN Name=Unc5b; Synonyms =Unc5h2 ; 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC STRAIN=C57BL/6J; TISSUE=Medulla oblongata; 

RX MEDLINE=22354683; PubMed=124 66851; DOI=10 . 1038/nature01266; 

RA Okazaki Y., Furuno M., Kasukawa T., Adachi.J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A. , Quackenbush J. , 

RA Schriml L.M. , Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E. , Dragani T.A., Fletcher C.F., Forrest A. , Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J . , 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A. , Kawaji H. , Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L. , McKenzie L., Miki H., 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Pertea G. f Pesole G., 

RA Petrovsky N., Pillai R. , Pontius J.U., Qi D., Ramachandran S. f 

RA Ravasi T., Reed J.C., Reed D.J. f Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K. , 

RA Sultana R. , Takenaka Y-, Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y., Watanabe Y. , Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M., Yang I., Yang L., 

RA Yuan Z., Zavolan M., Zhu Y., Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N . , Sato K. , 

RA Shiraki T., Waki K., Kawai J., Aizawa K., Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K. , Ishii Y., Itoh M. , Kagawa I:, 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K., Shinagawa A., 

RA Yasunishi A. , Yoshino M. , Waters ton R. , Lander E.S., Rogers J., 

RA Birney E., Hayashizaki Y.; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573 (2002) . 



RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R. D. , Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M. B. , Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M., Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP TISSUE SPECIFICITY. 

RX PubMed=12799072; 

RA Dalvin S., Anselmo M.A., Prodhan P., Komatsuzaki K., Schnitzer J. J., 

RA Kinane T.B. ; 

RT "Expression of Netrin-1 and its two receptors DCC and UNC5H2 in the 

RT developing mouse lung."; 

RL Gene Expr. Patterns 3:279-283(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with GNAI2 via its cytoplasmic part (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q8KlS3-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS3-2; Sequence=VSP_011699 ; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Expressed in lung 
CC during late development. Expressed during early blood vessel 

CC formation, in the semicircular canal and in a dorsal to ventral 

CC gradient in the retina. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC. cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 
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apoptosis (By similarity) 



SIMILARITY: 
SIMILARITY: 

SIMILARITY: Contains 

SIMILARITY: Contains 

SIMILARITY: Contains 

SIMILARITY: Contains 



Belongs to the UNC-5 family. 
Contains 1 death domain. 

1 immunoglobulin-like C2-type domain. 

1 immunoglobulin-like domain. 

2 TSP type-1 domains. 
1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 

EMBL; AJ487853; CAD32251.1; 

EMBL; AK018177; BAB31108.1; 

EMBL; BC048162; AAH48162.1; ALT_INIT. 

EMBL; BC057560; AAH57560.1; 

HSSP; P07996; 1LSL. 

MGD; MGI: 894703; Unc5b. 

InterPro; IPR000488; Death. 

InterPro; IPR011029; DEATH_like. 

InterPro; IPR007110; Ig-like. 

InterPro; IPR003598; Ig_c2 . 

InterPro; IPR000884; TSP1. 

InterPro; IPR008085; TSP_1. 

InterPro; IPR000906; ZU5. 

Pfam; PF00531; Death; 1. 

Pfam; PF00047; ig; 1. 

Pfam; PF00090; TSP_1; 2. 

Pfam; PF00791; ZU5; 1. 

PRINTS; PR01705; TSP1REPEAT. 

SMART; SM00005; DEATH; 1. 

SMART; SM00408; IGc2; 1. 

SMART; SM00209; TSP1; 2. 

SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH_DOMAIN ; 1. 

PROSITE; PS50835; IG_LIKE; 1. 

PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 
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Potential . 
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Netrin receptor UNC5B. 
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Extracellular (Potential) . 
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Potential . 


FT 
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945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 
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Ig-like. 


FT 


DOMAIN 
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Ig-like C2-type. 
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TSP type-1 1. 
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DOMAIN 
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354 


TSP type-1 2. 
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DOMAIN 


541 


644 


ZU5. 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) (By similarity) . 
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SITE 
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725 


Interaction with DCC (By similarity) . 
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/FTId=VSP 011699. 




FT 


CONFLICT 


238 


238 


T -> A (in Ref . 2) 




FT 


CONFLICT 


394 


394 


V -> E (in Ref. 2) 




FT 


CONFLICT 


679 


679 


T -> S (in Ref. 2) 




FT 


CONFLICT 


874 


874 


N -> D (in Ref. 2) 




SQ 


SEQUENCE 


945 AA; 


103738 


MW;- 80E896FOFOE06012 CRC64; 



Query Match 49.0%; Score 274; DB 1; Length 945; 

Best Local Similarity 51.9%; Pred. No. 8.6e-21; 

Matches 54; Conservative 18; Mismatches 30; Indels 2; Gaps 2; 

Qy 2 SNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTL 60 

I : III I II I I : I I I : I I I : I I I I : I I I : : I I : : I I I I I : I I I : 
Db 542 SSSVSGTFGCLGGRLSLPGTGVSLLVPNGAIPQGKFYDLYLHINKAEST-LPLSEGSQTV 600 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : I I I I I : I : I I I I : I I I : : I 

Db 601 LS PSVTCGPTGLLLCRPVVLTVPHCAEVI AGDWI FQLKTQAHQG 644 



RESULT 12 
UN5D MOUSE 



,ID UN5D__MOUSE STANDARD; PRT; 956 AA. 

AC Q8K1S2; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 homolog 4) . 

GN Name=Unc5d; Synonyms=Unc5h4 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFIICTY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse Unc5 genes and their expression patterns at 

RT mid-gestation."; 

RL Mech. Dev. 118:191-197(2002). 

CC -!- FUNCTION: Receptor for netrin involved in cell migration. May be 
CC involved in axon guidance by mediating axon repulsion of neuronal 

CC growth cones in the developing nervous system upon ligand binding. 

CC Axon repulsion in growth cones may be caused by its association 

CC with DCC that may trigger signaling for repulsion. It also acts as 

CC a dependence receptor required for apoptosis induction when not 

CC associated with netrin ligand (By similarity) . 

CC - ! - SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Expressed in developing limb and mammary 
CC gland. 



CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ487854; CAD32252.1; -. 

DR HSSP; P07996; 1LSL. 

DR MGD; MGI: 2389364; UncSd. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT . 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Apoptosis; Developmental protein; Immunoglobulin domain; 

KW Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 
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Potential . 
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Netrin receptor UNCSD. 
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Extracellular (Potential) . 
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Potential . 
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Cytoplasmic (Potential) . 


FT 


DOMAIN 


52 


149 


Ig-like. 


FT 
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151 


242 


Ig-like C2-type. 
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DOMAIN 
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304 


TSP type-1 1. 


FT 


DOMAIN 
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358 


TSP type-1 2. 


FT 


DOMAIN 


543 


645 


ZU5. 


FT 


DOMAIN 


862 


939 


Death. 


FT 


SITE 


419 


420 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


706 


724 


Interaction with DCC (By similarity) . 



FT 


DISULFID 


73 


132 


By similarity. 




FT 


DISULFID 


178 


229 


By similarity. 
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CARBOHYD 
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115 


N-linked (GlcNAc. . 


.) (Potential). 


FT 
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FT 
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N-linked (GlcNAc. . 


. ) (Potential) . 


FT 


CARBOHYD 


379 


379 


N-linked (GlcNAc. . 


.) (Potential). 


SQ 


SEQUENCE 


956 AA; 


106351 


MW; DFDF07839C10C68D 


CRC64; 


Query Match 




43.9%; 


Score 245.5; DB 1; 


Length 956; 



Best Local Similarity 46.2%; Pred. No. l.le-17; 

Matches 48; Conservative 21;^ Mismatches 34; Indels 1; Gaps 1; 

Qy 1 TSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTL 60 

I III I I I I I : : I I I I : I I I I I III : I | I : : : : : I I I : I 

Db 543 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 601 

Qy 61 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEG 104 

II I I : I I I I : I : I I I : I I : I : I : : I I I : : : I 

Db 602 LSPEVTCGPPDMLVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 645 



RESULT 13 
UN5D_HUMAN 

ID UN5D_HUMAN STANDARD; PRT; 953 AA. 

AC Q6UXZ4; Q8WYP7; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 homolog 4) 

DE (UNQ6012/PRO34692) . 

GN Name=UNC5D; Synonyms=KIAA1777, UNC5H4; 

OS Homo sapiens (Human) . . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RX MEDLINE=22887296; PubMed=12975309; DOI=10 . 1101/gr . 1293003; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D., Brush J., 

RA Chen J., Chow B. , Chui C, Crowley C, Currell B., Deuel B.,. Dowd P., 

RA Eaton D., Foster J., Grimaldi C, Gu Q., Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L. , Jin Y., Johnson S., Lee J., 

RA Lewis L., Liao D., Mark M. , Robbie E., Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A. , 

RA Vandlen R. , Watanabe C, Wieand D., Woods K. , Xie M.-H., Yansura D., 

RA Yi S., Yu G. f Yuan J. f Zhang M. , Zhang Z., Goddard A., Wood W.I., 

RA Godowski P., Gray A. ; 

RT "The secreted protein discovery initiative (SPDI) , a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [2] 

RP SEQUENCE" FROM N.A. (ISOFORM 2) . 

RC TISSUE=Brain; 

RA Nakajima D., Nakayama M. , Nagase T. f Ohara O. ; 

RT "Identification of unc5H4 gene."; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 



CC -!- FUNCTION: Receptor for netrin. May be involved in axon guidance by 

CC mediating axon repulsion of neuronal growth cones in the 

CC developing nervous system upon ligand binding- Axon repulsion in 

CC growth cones may be caused by its association with DCC that may 

CC trigger signaling for repulsion. It also acts as a dependence 

CC receptor required for apoptosis induction when not associated with 

CC netrin ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l ; 

CC IsoId=Q6UXZ4-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q6UXZ4-2; Sequence=VSP_011703; 

CC , Note=No experimental confirmation available; 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 
CC SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC — — 

DR EMBL; AB055056; BAB83663.1; -. 

DR EMBL; AY358147; AAQ88514.1; 

DR HSSP; P07996; 1LSL. 

DR Genew; HGNC: 18634; UNC5D. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 
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Alternative splicing; Apoptosis; Developmental protein; 
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Immunoglobulin domain; 


Phosphorylation; Receptor; Repeat; Signal; 
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Cytoplasmic (Potential) . 
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Ig-like C2-type. 
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TSP type-1 1. 
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TSP type-1 2. 
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ZU5. 
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Death. 
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Cleavage (by caspase-3) (By similarity) . 
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Interaction with DCC (By similarity) . 
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By similarity. 
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By similarity. 
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/FTId=VSP 011703. 


SQ 


SEQUENCE 


953 AA; 


105879 


MW; 5F893B9DF746F731 CRC64 ; 


Query Match 




43 


.6%; 


Score 243.5; DB 1; Length 953; 



Best Local Similarity 45.2%; Pred. No. 1.8e-17; 

Matches 47; Conservative 22; Mismatches 34; Indels 1; Gaps 1; 



Qy 1 TSNMT YGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 60 



Db 540 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 598 

Qy 61 LSPIVSCGPPGVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 104 

I I I I : I I I I : : : I I I : I I : - I : I : : I I I : : : I 
Db 599 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQG 642 



RESULT 14 
Q68DX9 

ID Q68DX9 PRELIMINARY; PRT; 1267 AA. 

AC Q68DX9; 

DT 25-OCT-2004 (TrEMBLrel. 28, Created) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE Hypothetical protein DKFZp686M05161 . 

GN Name=DKFZp686M05161; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI TaxID=9606; 



RN 
RP 
RC 
RG 
RA 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



[1] 

SEQUENCE FROM N.A. 
TISSUE=Uterus endothel; 
The German cDNA Consortium; 

Poustka A., Albert R. , Moosmayer P., Schupp I., Wellenreuther R. , 
Mewes H.W., Weil B., Amid C, Osanger A. , Fobo G., Han M. , Wiemann S. 
Submitted (AUG-2004) to the EMBL/GenBank/DDBJ databases. 
EMBL; CR749235; CAH18091.1; -. 
InterPro; IPR008144; Guanylate_kin. 

IPR008145; Guanylt/Ca. 
IPR001478; PDZ . 
IPR001452; SH3. 
IPR011511; SH3_2. 
IPR005417; ZonOcculdens . 
IPR005418; ZonOcculSl. 
IPR000906; ZU5 . 

Guanylate_kin; 1. 
PDZ; 1. 
SH3_1; 1. 
SH3_2 ; 1 . 
ZU5; 1. 

PRINTS; PR01597; ZONOCCLUDNS. 
PRINTS; PR01598; Z0N0CCLUDNS1 . 
SMART; SM00072; GuKc; 1. 
SMART; SM00228; PDZ; 1. 
SMART; SM00218; ZU5 ; 1. 

PROSITE; PS50052; GUANYLATE_KINASE_2 ; 1. 
PROSITE; PS50106; PDZ; 1. 
PROSITE; PS50002; SH3; 1. 
Hypothetical protein. 

SEQUENCE 1267 AA; 142571 MW; 734B3877FBCAC6FA CRC64; 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
Pfam; PF00625; 
Pfam; PF00595; 
PF00018; 
PF07653; 
PF00791; 



Pfam; 
Pfam; 
Pfam; 



Query Match 34.3%; Score 192; DB 2; Length 1267, 

Best Local Similarity 43.6%; Pred. No. le-11; 

Matches 41; Conservative 12; Mismatches 39; Indels 



2; Gaps 



1; 



Qy 7 GTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPL— AGCQTLLSPI 64 

III III I I : I : : I I I I I I Ml : : : II : I I I I I : 

Db 1137 GI FNSNGGVLSS I ETGVS III PQGAI PEGVEQEI YFKVCRDNS I LPPLDKEKGETLLS PL 1196 

Qy 65 VSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLK 98 

Mill: : I I I : I I Mill II 
Db 1197 VMCGPHGLKFLKPVELRLPHCASMT PDGWS FALK 1230 



RESULT 
097758 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



15 



OC 
OC 



PRELIMINARY; 



PRT; 1769 AA. 



Created) 

Last sequence update) 
Last annotation update) 



097758 
097758; 

01-MAY-1999 (TrEMBLrel. 10 , 
01-MAY-1999 (TrEMBLrel. 10, 
01-MAR-2004 (TrEMBLrel. 26, 
ZO-1 MDCK. 
Name=Z01-MDCK; 
Canis familiaris (Dog) . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 



Euteleostomi; 
Canis . 



1 



OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=breed: Cocker Spaniel; 

RX MEDLINE=99196918; PubMed=10094817 ; DOI=10 . 1006/excr . 1999 . 4392 ; 

RA Gonzalez-Mariscal L., Islas S., Contreras R.G., Garcia-Villegas M.R., 

RA Betanzos A. , Vega J., Diaz-Quinonez A., Martin-Orozco N., 

RA Ortiz-Navarrete V., Cereijido M., Valdes J.; 

RT "Molecular characterization of the tight junction protein ZO-1 in MDCK 

RT cells."; 

RL Exp. Cell Res. 248:97-109(1999). 

CC -!- SIMILARITY: Contains 3 PDZ/DHR domains. 

DR EMBL; U55935; AAD11529. 1; -. 

DR HSSP; P31016; 1JXO. 

DR GO; GO: 0005923; C: tight junction; IEA. 

DR GO; GO: 0005515; F:protein binding; IEA. 

DR InterPro; IPR008144; Guanylatejcin . 

DR InterPro; IPR008145; Guanylt/Ca. 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR001452; SH3 . 

DR InterPro; IPR011511; SH3_2 . 

DR InterPro; IPR005417; ZonOcculdens . 

DR InterPro; IPR005418; ZonOcculSl. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00625; Guanylate_kin; 1. 

DR Pfam; PF00595; PDZ; 3. 

DR Pfam; PF07653; SH3_2; 1. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PRO 1597; ZONOCCLUDNS. 

DR PRINTS; PR01598; ZONOCCLUDNS 1 . 

DR SMART; SM00072; GuKc; 1. 

DR SMART; SM00228; PDZ; 3. 

DR SMART; SM00326; SH3; 1. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50052; GUANYLATE_KINASE_2 ; 1. 

DR PROSITE; PS50106; PDZ; 3. 

DR PROSITE; PS50002; SH3; 1. 

SQ SEQUENCE 1769 AA; 197606 MW; 181E9F36CEBC96EF CRC64; 

Query Match 34.3%; Score 192; DB 2; Length 1769; 

Best Local Similarity 43.6%; Pred. No. 1.5e-ll; 

Matches 41; Conservative 12; Mismatches 39; Indels 2; Gaps 1; 

Qy 7 GTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPL — AGCQTLLSPI 64 

III III I I : I : : I I I I I I III : : : I I : I I I I I : 

Db 1639 GVFNNNGGVLSSIETGVSIIIPQGAIPEGVEQEIYFKVCRDNSILPPLDKEKGETLLSPL 1698 

Qy 65 VSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLK 98 

Mill: : I I I : I I Mill II 
Db 1699 VMCGPHGLKFLKPVELRLPHCASMT PDGWS FALK 1732 



Search completed: March 1, 2005, 09:03:41 
Job time : 21.025 sees 



